SQL. It is a joke, but an SQL engine can be massively parallel. You just don't k...

drivebyhooting · 2025-11-05T21:07:10 1762376830

My issue with SQL is lack of composability and difficulty of debugging intermediate results.

mamcx · 2025-11-06T04:30:51 1762403451

Yes, SQL is poor.

What could be good is relational + array model. I have some ideas on https://tablam.org, and building not just the language but the optimizer in tandem I think will be very nice.

oembar4 · 2025-11-06T11:07:04 1762427224

The programming style reminds me of the old days of clipper and xbase family, even ABAP. I like the syntax.

zozbot234 · 2025-11-06T17:12:18 1762449138

You can use SQL CTE's and/or VIEW's as a composable abstraction over queries and inspect intermediate results. The language features are there.

kragen · 2025-11-06T11:53:37 1762430017

The standard things that someone should always say when someone brings up this problem is:

• Datalog is much, much better on these axes.

• Tutorial D is also better than SQL.

Too · 2025-11-06T21:06:29 1762463189

Check out https://prql-lang.org/

It solves all the warts of sql while still being true to its declarative execution. Trailing commas, from statement first and reads as a a composable pipeline, temporary variables for expressions, intuitive grouping.

asadm · 2025-11-05T21:27:53 1762378073

is it a language problem though? it's just lack of tooling.

theLiminator · 2025-11-05T21:38:40 1762378720

The dataframe paradigm (a good example being polars) is another good alternative that's more composable (imo).

fifilura · 2025-11-06T02:30:52 1762396252

It is true. I still hate it. I think because it always offers 10 different ways to do the same thing. So it is just too much to remember.

gnulinux · 2025-11-06T16:18:47 1762445927

Even in this thread people underestimate how good e.g. DuckDB can be if you swallow its quirks. Yeah SQL has many problems, but with a slightly extended language with QoL features and seamless parallelism DuckDB is extremely productive if you want to crunch bunch of numbers in the order of minutes, hours etc (not real time).

Sometimes I have a problem, I just generate bunch of "possible solutions" with a constraint solver (e.g. Minizinc) which generates GBs of CSVs describing bunch of solutions, then let DuckDB analyze which ones are suitable, DuckDB is amazing.

taeric · 2025-11-06T00:09:53 1762387793

More generally, the key here is that the more magic you want in the execution of your code, the more declarative you want the code to be. And SQL is pretty much the poster child declarative language out there.

Term rewriting languages probably work better at this than I would expect? It is kind of sad how little experience with that sort of thing that I have built up. And I think I'm above a large percentage of developers out there.

dvrp · 2025-11-05T22:03:19 1762380199

If you want to work in data engineering for massive datasets (many petabytes) pls hit me up!

fifilura · 2025-11-06T13:12:51 1762434771

Sorry, wrong continent :)