It’s funny how there is continuous reinvention of parsing approaches. Why isn’t ...

embedding-shape · 2026-01-20T10:46:59 1768906019

Hardware also changes across time, so while something that was initially fast, people with new hardware tries it, finds it now so fast for them, then create their own "fast X". Fast forward 10 more years, someone with new hardware finds that, "huh why isn't it using extension Y" and now we have three libraries all called "Fast X".

zahlman · 2026-01-20T14:34:48 1768919688

Because you have to learn how to use any given parser generator, naive code is easy to write, and there are tons of applications for parsing that aren't really performance critical.

high_na_euv · 2026-01-20T10:57:03 1768906623

I'd say because parsing is very specific kind of work heavily dependent on the grammar you're dealing with

munificent · 2026-01-20T23:39:15 1768952355

A parser spends time:

1. Consuming tokens.

2. Recognizing the grammar.

3. Producing AST nodes.

Steps 1 and 3 are heavily dependent on the data types that make the most sense for the previous (lexing) and next (semantic analysis) phases of the compiler. There is no one Token type that works for every language, nor one AST type.

The recognizing the grammar part is relatively easy, but since so much of the code is consuming and producing datatypes that are unique to a given implementation, it's hard to have very high performance reusable libraries.

mgaunard · 2026-01-20T10:39:42 1768905582

There are good parser generators, but potentially not as Rust libraries.

westurner · 2026-01-20T22:17:25 1768947445

I decided to look; just found these:

chumsky (parser combinator): https://github.com/zesterer/chumsky

LALRPOP (LR(1)): https://github.com/lalrpop/lalrpop

grmtools (YACC) https://github.com/softdevteam/grmtools/ re: Other parsers: https://softdevteam.github.io/grmtools/master/book/othertool...

antlr4rust: https://github.com/rrevenantt/antlr4rust

mgaunard · 2026-01-21T08:55:58 1768985758

Meanwhile C++ has more than a hundred, with a focus on production-ready rather than innovative design patterns.