More

meisel · 2026-01-28T17:21:43 1769620903

This all just sounds like problems we see when making new features, of any sort, for customers. A feature is never objectively done, there are many opinions on its goodness or badness, once it’s released its mistakes can last with it, etc.

If this is a wicked problem, then so is much of other real-world engineering.

meisel · 2026-01-22T22:08:48 1769119728

To be clear, is AI actually at play here, aside from the fact that the repo is for Gemini? It just looks like two simple rules that interact poorly, that we could've seen in 2015.

tuetuopay · 2026-01-22T22:53:39 1769122419

Well, it's even more ironic as AI in general is touted as smart. I'd fully expect such bots to notice they're in a loop and one to throw the towel. Still a long way to AGI. And to AI for that matter.

meisel · 2026-01-15T14:57:04 1768489024

The real solution is that as soon as you need square brackets, switch to a better language than bash

layer8 · 2026-01-15T15:08:51 1768489731

Why would you need square brackets if test is the same?

meisel · 2026-01-12T21:43:28 1768254208

Looks like it beats everything in the large text compression benchmark for enwik8, but loses to several programs for enwik9. I wonder why that is.

AnotherGoodName · 2026-01-12T22:37:33 1768257453

It's actually not the best at enwik8 or 9.

The results at https://www.mattmahoney.net/dc/text.html explicitly add the size of the compressor itself to the result. Note the "enwik9+prog" column. That's what it's ranked on.

The reason to do this is that it's trivial to create a compressor that 'compresses' a file to 0 bytes. Just have an executable with a dictionary of enwik9 that writes that out given any input. So we always measure what is effectively the Kolmogorov complexity. The data+program as a whole that produces the result we want.

So those results add in the compressor size. The programs there generally have no dictionary built in or in the case of LLM based compressors, no pre-trained data. They effectively build the model as they process data. Not compressing much at all at the start and slowly compressing better and better as they go. This is why these programs do better and better with larger data sets. They start with 0 knowledge. After a GB or so they have very good knowledge of the corpus of human language.

This program here however is pre-trained and shipped with a model. It's 150MB in size! This means it has 150MB of extra starting knowledge over those models in that list. The top models in that list are the better compressors, they'll quickly out learn and overtake this compressor but they just don't have that headstart.

Of course measuring fairly this should be listed with that 150MB program size added to the results when doing a comparison.

srcreigh · 2026-01-12T22:47:02 1768258022

As an aside, I wonder how to account for the information content embedded in the hardware itself.

A Turing Machine compressor program would likely have more bytes than the amd64 binary. So how to evaluate KolmogorovComplexity(amd64)?

The laws of physics somehow need to be accounted for too, probably.

d_burfoot · 2026-01-12T22:57:44 1768258664

Kolmogorov Complexity is only defined up to a constant, which represents Turing machine translation length.

notpushkin · 2026-01-13T04:28:41 1768278521

I guess we need to guesstimate the length of a shortest Turing machine implementation of amd64 then?

srcreigh · 2026-01-13T18:05:53 1768327553

This is cool. No need to guesstimate, it could be a world record category.

Dylan16807 · 2026-01-13T04:18:18 1768277898

The complexity of a simple turing machine is itty bitty, and you can bootstrap that into an x86 emulator in a matter of kilobytes, so when we're messing with 100MB files it's not a big factor.

rao-v · 2026-01-13T02:13:08 1768270388

If every version of your OS ships with a default local model, it maybe interesting to see compression conditioned on the standard LLM weights

meisel · 2026-01-08T18:12:26 1767895946

How much less? Their companies clearly have access to the latest GPUs.

anishgupta · 2026-01-08T20:54:30 1767905670

This video can bring some perspective https://www.youtube.com/watch?v=CrJJPlRO9bI around 5:11

They use H800 as opposed to major US ones on H100 (2-3x faster)

meisel · 2026-01-05T14:50:19 1767624619

What is with the annoying snow going over the text? That is a pretty “arbitrary” graphic element

meisel · 2025-12-29T14:18:49 1767017929

> Responses to my publication submissions often claimed such problems did not exist

I see this often even in communities of software engineers, where people who are unaware of certain limitations at scale will announce that the research is unnecessary

meisel · 2025-12-28T16:49:58 1766940598

I'm curious what objects do/don't survive at the water pressure. I guess bottles are strong enough

_yb2s · 2025-12-28T18:00:28 1766944828

Solids and liquids mostly don’t compress so as a general rule most can handle those pressures without experiencing any real mechanical stress, as they instantly provide a perfectly matching internal pressure that balances out the forces to zero.

It’s mostly things that contain gases that can get crushed by high pressure. Almost any type of closed cell foam for example, will either collapse to a small size or crack and crumble apart depending on how rigid it is.

Living things tend to get harmed by pressure changes because they have compressible gasses and/or biological compartments that contain things that experience phase changes between gas and liquid at different pressures.

fogleman · 2025-12-28T16:50:58 1766940658

Presumably it was open & empty, so it's just a piece of glass surviving...

dmurray · 2025-12-28T17:16:01 1766942161

Even so, wouldn't you expect that you could crush an open empty beer bottle by putting a heavy enough weight on it? A human can't do it, but I would expect an elephant can.

bracketfocus · 2025-12-28T17:19:24 1766942364

The pressure inside the bottle is the same as the outside. So it’s not the same as stomping on an empty bottle.

RajT88 · 2025-12-28T17:35:49 1766943349

There is quite a lot of pressure put outside from the beer of a full bottle, but that little bit of air is probably enough to cause it to implode at some point.

I'll be honest; I have no idea how to estimate that. I'm sure there are folks on here who can (and might). It's probably not as deep as you'd think.

wolvoleo · 2025-12-28T23:59:45 1766966385

I'd imagine the cap will fail under the pressure long before the forces get high enough to implode the bottle.

jdmoreira · 2025-12-28T17:18:56 1766942336

That's not how pressure works if it's opened. The forces balance out

FL410 · 2025-12-28T17:19:55 1766942395

But the forces are the same all around the bottle (again assuming it is open)

rags2riches · 2025-12-28T17:21:47 1766942507

The bottle wouldn't be empty though.

mihaaly · 2025-12-28T19:10:11 1766949011

meisel · 2025-12-26T20:22:07 1766780527

What does this sort of language complexity mean for future changes in Rust? In C++, its existing complexity makes new changes so much more difficult. Is Rust reaching a similar place?

kibwen · 2025-12-26T20:47:44 1766782064

Can you be more specific? What language complexity are you referring to? Plenty of things in this post are hypothetical, not actual features being surfaced by the language.

kmeisthax · 2025-12-26T23:07:37 1766790457

Rust only supports & and &mut references. This was a deliberate choice early in the development of the language.

&own, &pin, and &uninit are proposals for additional pointer types. They don't actually exist in the type system right now, but other parts of the compiler do have to care about them. Another blog post that floated around here about a month ago called these "inconceivable types"[0]; adding them to the type system would allow formally extending these behaviors across function boundaries.

Like, right now, implementers of Drop can't actually move anything out of the value that's about to be destroyed. The Drop trait gets a &mut, but what we really want is to say "destroy this value over there". Rust's type system cannot understand that you own the value but not the place it lives in. What you need is an "owned reference" - i.e. &own, where the borrow checker knows that you can safely move out of it because it's going to get destroyed anyway.

Rust also can't support constructors, for the same reason. What we really have are factory functions: you call them, they return a value, you put it somewhere. This is good enough that Rust users just treat factory functions as if they were constructors, but we can't do "placement new" type construction with them, or partial initialization. At least not in a way the type system can actually check.

&pin is a first-class version of Pin<T>. Rust was originally designed under the assumption that any type can be memcpy'd at any time; but it turns out not being able to move types is actually super useful. Fortunately, it also turned out you could use smart pointers to pin types, which was 'good enough' for what it was being used for - async code.

Actually, the blog post that coined "inconceivable types" was specifically talking about writing async functions without async. It turns out Future impls encode a lot of details Rust's type system can't handle - notably, self-borrows. If a value is borrowed across an await, what's the type of the variable that got borrowed from? It's really a negative type: borrowing T to get &T also turns T into !'a T that you can't access until 'a ends. Each borrowed reference is paired to a debt that needs to be paid back, and to do that you need lifetime variables and syntax to explicitly say "pay back this debt by ending this borrow's lifetime".

How much of this complexity is actually needed is another question. There's a problem that each and every one of these reference types (or, anti-types) is intended to solve. Obviously if we added all of them, we'd overcomplicate the type system. But at the same time, the current Rust type system is already known to be oversimplified, to the point where we had to hack in pinning for async. And it's already kind of ridiculous to say "Well, async is all special compiler magic" because it prevents even reasonable-sounding tweaks to the system[1].

[0] https://blog.polybdenum.com/2024/06/07/the-inconceivable-typ...

[1] For example, async does not currently have a way to represent "ambient context" - i.e. things we want the function to be able to access but NOT hold onto across yields. That would require a new Future trait with a different poll method signature, which the current Rust compiler doesn't know how to fill or desugar to. So you have to use the core Future trait and signature which doesn't support this kind of context borrow.

To work around this limiation involves a lot of unnecessarily verbose code to drop and regain context between awaits, i.e. https://github.com/ruffle-rs/ruffle/blob/b5732b9783dce5d2311...

meisel · 2025-12-23T20:17:57 1766521077

I think only a small percentage of users care that much about running LLMs locally to pay for extra hardware for it, put up with slower and lower-quality responses, etc. . It’ll never be as good as non-local offerings, and is more hassle.