Jupyter Notebook Natural Language Processing

Open-source Jupyter Notebook projects categorized as Natural Language Processing

Top 23 Jupyter Notebook Natural Language Processing Projects

Natural Language Processing
  1. Made-With-ML

    Learn how to design, develop, deploy and iterate on production-grade ML applications.

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. nlp-tutorial

    Natural Language Processing Tutorial for Deep Learning Researchers

  4. nlpaug

    Data augmentation for NLP

  5. pytorch-sentiment-analysis

    Tutorials on getting started with PyTorch and TorchText for sentiment analysis.

  6. FLAML

    A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

  7. ml-course

    Open Machine Learning course

  8. mlops-course

    Learn how to design, develop, deploy and iterate on production-grade ML applications.

  9. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  10. pythoncode-tutorials

    The Python Code Tutorials

  11. EasyEdit

    [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

  12. ecco

    Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).

  13. bert_score

    BERT score for text generation

  14. awesome-ai-ml-dl

    Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.

  15. superlinked

    Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.

    Project mention: You Don't Need Re-Ranking: Understanding the Superlinked Vector Layer | news.ycombinator.com | 2025-05-23
  16. transformers-interpret

    Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

  17. fastText_multilingual

    Multilingual word vectors in 78 languages

  18. question_generation

    Neural question generation using transformers

  19. ThoughtSource

    A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/

  20. conformal-prediction

    Lightweight, useful implementation of conformal prediction on real data.

  21. hate-speech-and-offensive-language

    Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017

  22. PIXIU

    This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).

  23. malaya

    Natural Language Toolkit for Malaysian language, https://malaya.readthedocs.io/

  24. fromage

    🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

  25. bert-sklearn

    a sklearn wrapper for Google's BERT model

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Jupyter Notebook Natural Language Processing discussion

Jupyter Notebook Natural Language Processing related posts

  • Build generative search engine with verifiable answers

    1 project | news.ycombinator.com | 27 Jul 2025
  • Open source expert system with verifiable answers

    1 project | news.ycombinator.com | 20 Jul 2025
  • VerifAI – Generative Search/Productivity engine with Verifiable answers (star)

    1 project | news.ycombinator.com | 15 Jul 2025
  • VerifAI – open-source generative search with verification

    1 project | news.ycombinator.com | 8 May 2025
  • VerifAI – open-source private/organizational gen search

    1 project | news.ycombinator.com | 1 Mar 2025
  • VerifAI – document-based question-answering systems

    1 project | news.ycombinator.com | 21 Feb 2025
  • Generative Search for Everyone

    1 project | news.ycombinator.com | 2 Feb 2025
  • A note from our sponsor - Stream
    getstream.io | 23 Dec 2025
    Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure. Learn more →

Index

What are some of the best open-source Natural Language Processing projects in Jupyter Notebook? This list will help you:

# Project Stars
1 Made-With-ML 44,375
2 nlp-tutorial 14,782
3 nlpaug 4,637
4 pytorch-sentiment-analysis 4,571
5 FLAML 4,261
6 ml-course 3,398
7 mlops-course 3,222
8 pythoncode-tutorials 2,955
9 EasyEdit 2,667
10 ecco 2,062
11 bert_score 1,827
12 awesome-ai-ml-dl 1,623
13 superlinked 1,456
14 transformers-interpret 1,392
15 fastText_multilingual 1,199
16 question_generation 1,139
17 ThoughtSource 1,005
18 conformal-prediction 993
19 hate-speech-and-offensive-language 833
20 PIXIU 814
21 malaya 518
22 fromage 482
23 bert-sklearn 302

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Jupyter Notebook is
the 13th most popular programming language
based on number of references?