feature-engineering-tutorials

Data Science Feature Engineering and Selection Tutorials (by rasgointelligence)

Feature-engineering-tutorials Alternatives

Similar projects and alternatives to feature-engineering-tutorials

rasgointelligence
feature-engineering-tutorials
  1. RasgoQL

    Write python locally, execute SQL in your data warehouse

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. jupyter-notebook-chatcompletion

    Jupyter Notebook ChatCompletion is VSCode extension that brings the power of OpenAI's ChatCompletion API to your Jupyter Notebooks!

  4. PRML

    1 feature-engineering-tutorials VS PRML

    PRML algorithms implemented in Python

  5. PyImpetus

    PyImpetus is a Markov Blanket based feature subset selection algorithm that considers features both separately and together as a group in order to provide not just the best set of features but also the best combination of features

  6. FeatureHub

    The most comprehensive library of AI/ML features across multiple domains. Our goal is to create a dataset that serves as a valuable resource for researchers and data scientists worldwide (by FeatureHub-AI)

  7. dtreeviz

    A python library for decision tree visualization and model interpretation.

  8. MachineLearningNotebooks

    Python notebooks with ML and deep learning examples with Azure Machine Learning Python SDK | Microsoft

  9. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  10. ydata-quality

    Data Quality assessment with one line of code

  11. reinforcement_learning_course_materials

    Lecture notes, tutorial tasks including solutions as well as online videos for the reinforcement learning course hosted by Paderborn University

  12. desbordante-core

    Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

  13. gastrodon

    Visualize RDF data in Jupyter with Pandas

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better feature-engineering-tutorials alternative or higher similarity.

feature-engineering-tutorials discussion

feature-engineering-tutorials reviews and mentions

Posts with mentions or reviews of feature-engineering-tutorials. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-03-08.
  • How to balance multiple time series data?
    2 projects | /r/datascience | 8 Mar 2022
    I’ve actually solved a similar problem several times in a variety of settings. I’ve had success with boosted trees and feature engineering on the sensor readings over time. I treat each reading as an observation and set the target to be the value I want to forecast (e.g. one hour ahead, the sum over the next day, the value at the same time the next day). There was a recent paper that compared boosted trees to deep learning techniques and found the boosted trees performed really well. Next, I perform feature engineering to aggregate the data up to the current time. These features will include the current value, lagged values over multiple observations for that sensor, more complicated features from moving statistics over different time scales, etc. I actually wrote a blog about creating these features using the open-source package RasgoQL and have similar types of features shared in the open-source repository here. I have also had success creating these sorts of historical features using the tsfresh package. Finally, when evaluating the forecast, use a time based split so earlier data is used to train the model and later data to evaluate the model.

Stats

Basic feature-engineering-tutorials repo stats
1
289
0.0
10 days ago

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Jupyter Notebook is
the 13th most popular programming language
based on number of references?