Parsing
A grammar describes the syntax of a programming language, and might be defined in Backus-Naur form (BNF). A lexer performs lexical analysis, turning text into tokens. A parser takes tokens and builds a data structure like an abstract syntax tree (AST). The parser is concerned with context: does the sequence of tokens fit the grammar? A compiler is a combined lexer and parser, built for a specific grammar.
Here are 272 public repositories matching this topic...
Wrangling Untrusted File Formats Safely
- Updated
Jul 19, 2025 - C
Jsmn is a world fastest JSON parser/tokenizer. This is the official repo replacing the old one at Bitbucket
- Updated
Jun 9, 2024 - C
BNF wrangling and railroad diagrams
- Updated
Oct 24, 2025 - C
ClangKit provides an Objective-C frontend to LibClang. Source tokenization, diagnostics and fix-its are actually implemented.
- Updated
Aug 2, 2021 - C
como-lang-ng is now ana-lang, located at https://github.com/analang/ana
- Updated
Aug 8, 2019 - C
Standalone tree sitter bindings for the Lua language
- Updated
Sep 3, 2025 - C
A library for parsing FIX (Financial Information eXchange) protocol messages.
- Updated
May 12, 2025 - C
cparse is an LR(1) and LALR(1) parser generator
- Updated
Sep 23, 2025 - C
non-backtracking NFA-based regular expression library, for C and Python
- Updated
Oct 4, 2020 - C
Core Foundation, libyajl based JSON support.
- Updated
Dec 21, 2021 - C
[WIP] Yet another try at giving org-mode the non-EmacsLisp parser it also deserves
- Updated
Jul 2, 2021 - C
Single-file Parsing Expression Grammars (PEG) runtime engine for C
- Updated
Aug 17, 2025 - C
- Followers
- 11k followers
- Website
- github.com/topics/parsing
- Wikipedia
- Wikipedia