Skip to content
#

preprocessing-data

Here are 240 public repositories matching this topic...

Simple and automatic data cleaning in one line of code! It performs one-hot encoding, date & time casting to datetime dtype, detects binary columns, safely convert non-numeric columns to numeric dtypes, cleaning dirty/empty values, normalizing values and removing unwanted columns all in one line of code. Get your data ready for model training an…

  • Updated May 22, 2021
  • Python

This project develops an activity recognition model for a mobile fitness app using statistical analysis and machine learning. By processing smartphone sensor data, it extracts features to train models that accurately recognize user activities.

  • Updated Aug 6, 2024
  • Jupyter Notebook

Analyzing logistics data to optimize shipment efficiency, reduce delays, and enhance supply chain visibility using Power BI. Insights include top routes, delays, supplier trends, and peak shipments.

  • Updated Dec 23, 2025
  • Jupyter Notebook

This project involves web scraping, data preprocessing, database storage and visualization of IMDb movie data from the last decade (2014-2024). The dataset includes details of 10,000 movies such as name, release year, genre, ratings, metascore and more. The project culminates in an interactive Power BI dashboard for in-depth insights and reporting.

  • Updated Feb 3, 2025
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the preprocessing-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the preprocessing-data topic, visit your repo's landing page and select "manage topics."

Learn more