More

ep103 · 2025-12-07T00:32:43 1765067563

I have always been in favor of changing the definition if incorporation to ensure that over time ownership transfers slowly but increasingly to the employees of the corporate entity. How that would work, though, would require detailed thought by experts more knowledgeable than i :)

ep103 · 2025-11-19T17:59:59 1763575199

Its so nice to see this echo'd somewhere. This has been what I've been calling them for a while, but it doesn't seem to be the dominant view. Which is a shame, because it is a seriously accurate one.

ep103 · 2025-11-17T20:08:29 1763410109

This is my wife starting up a 20 minute conversation the moment the first actor shows up on the screen xD

Don't worry, I love her anyway. But yes, we're restarting the movie because no, I don't have any idea what happened either, you were talking. ahahaha

ep103 · 2025-10-17T13:09:30 1760706570

The benefit of cloud has always been that it allows the company to trade capex for opex. From an engineering perspective, it trades scalability for complexity, but this is a secondary effect compared to the former tradeoff.

PeterStuer · 2025-10-17T13:28:05 1760707685

"trade capex for opex"

This has nothing to do with cloud. Businesses have forever turned IT expenses from capex to opex. We called this "operating leases".

et1337 · 2025-10-17T13:22:21 1760707341

I’ve heard this a lot, but… doesn’t Hetzner do the same?

radiator · 2025-10-17T13:29:07 1760707747

Hetzner is also a cloud. You avoid buying hardware, you rent it instead. You can rent either VMs or dedicated servers, but in both cases you own nothing.

ep103 · 2025-10-15T17:53:05 1760550785

How are you guys spinning up vms, specifically windows vms, so quickly? I used to use virtual box back in the day, but that was a pain and required a manual windows OS install.

I'm a few years out of the loop, and would love a quick point in the right direction : )

baobun · 2025-10-15T19:47:29 1760557649

A lot of the world has moved on from virtualbox to primarily qemu+kvm and to some extent xen. Usually with some higher-level tool on top. Some of these are packages you can run on your existing OS and some are distributions with hypervisor for people who use VMs as part of their primary workflows. If you just want quick-and-easy one-off Windows VM and move on, check out quickemu.

Libvirt and virt-manager https://wiki.archlinux.org/title/Libvirt

Quickemu https://github.com/quickemu-project/quickemu

Proxmox VE https://www.proxmox.com/en/proxmox-ve

QubesOS https://qubes-os.org

Whonix https://whonix.org

XCP-ng https://xcp-ng.org/

You can also get some level of isolation by containers (lxc, docker, podman).

RandomBacon · 2025-10-15T18:01:54 1760551314

You take the time to set one up, then you clone it and use the clones for these things.

mjmas · 2025-10-15T21:34:45 1760564085

Windows does have a builtin sandbox that you can enable. (it also enables copy-paste to it)

biql · 2025-10-15T23:46:33 1760571993

Not sure about windows but I solved it for myself with basic provisioning script (could be an ansible playbook also) that installs everything on a fresh linux vm in a few minutes. For macos, there is tart vm that works well with arm64 (very little overhead compared to alternatives). Could be a rented cloud vm in a nearby location with low latency. Being a neovim user also helped not to having to worry about file sync when editing.

kwar13 · 2025-10-15T18:00:09 1760551209

For coding I normally run Linux VMs. But Windows should be doable as well. If you do a fresh install every time then sure it takes a lot of time, but if you keep the install in VirtualBox then it's almost as fast as you rebooting a computer.

singlow · 2025-10-15T19:59:18 1760558358

Also, you can spin up an ec2/azure/google vm pretty easy too. I do this frequently and it only costs a few bucks. Often more convenient to have it in the data center anyway.

oofbey · 2025-10-15T19:32:09 1760556729

A docker container isn’t as bulletproof as a VM but it would certainly block this kind of attack. They’re super fast and easy to spin up.

goodpoint · 2025-10-15T20:53:34 1760561614

It would not block many other attacks.

oofbey · 2025-10-15T22:55:38 1760568938

Can you give some examples? I think of my containers as decently good security boundaries, so I'd like to know what I'm missing.

kwar13 · 2025-10-16T06:57:50 1760597870

Containers share resources at the OS level, VMs don't. That's the crucial difference.

goodpoint · 2025-10-16T07:27:03 1760599623

Containers share the whole kernel (and more) so there's a massive attack surface.

yobert · 2025-10-15T23:22:06 1760570526

If you're on a Mac, you probably want OrbStack nowadays. It's fabulous!

ep103 · 2025-09-09T14:47:35 1757429255

Yesterday I used ChatGPT to transform a csv file. Move around a couple of columns, add a few new ones. Very large file.

It got them all right. Except when I really looked through the data, for 3 of the excel cells, it clearly just made up new numbers. I found the first one by accident, the remaining two took longer than it would have taken to modify the file from scratch myself.

Watching my coworkers blindly trust output like this is concerning.

photonthug · 2025-09-09T15:32:00 1757431920

After we fix the all the simple specious reasoning of stuff like Alexander-the-great and agree to out-source certain problems to appropriate tools, the high-dimensional analogs of stuff like Datasaurus[0] and Simpson's paradox[1] etc are still going to be a thing. But we'll be so disconnected from the representation of the problems that we're trying to solve that we won't even be aware of the possibility of any danger, much less able to actually spot it.

My take-away re: chain-of-thought specifically is this. If the answer to "LLMs can't reason" is "use more LLMs", and then the answer to problems with that is to run the same process in parallel N times and vote/retry/etc, it just feels like a scam aimed at burning through more tokens.

Hopefully chain-of-code[2] is better in that it's at least trying to force LLMs into emulating a more deterministic abstract machine instead of rolling dice. Trying to eliminate things like code, formal representations, and explicit world-models in favor of implicit representations and inscrutable oracles might be good business but it's bad engineering

[0] https://en.wikipedia.org/wiki/Datasaurus_dozen [1] https://towardsdatascience.com/how-metrics-and-llms-can-tric... [2] https://icml.cc/media/icml-2024/Slides/32784.pdf

dingnuts · 2025-09-09T19:10:53 1757445053

> it just feels like a scam aimed at burning through more tokens.

IT IS A SCAM TO BURN MORE TOKENS. You will know when it is no longer a scam when you either:

1) pay a flat price with NO USAGE LIMITS

or

2) pay per token with the ability to mark a response as bullshit & get a refund for those wasted tokens.

Until then: the incentives are the same as a casino's which means IT IS A SCAM.

phs318u · 2025-09-10T07:29:01 1757489341

Ding ding ding! We have a winner!

befictious · 2025-09-10T05:43:06 1757482986

>it just feels like a scam aimed at burning through more tokens.

I have a growing tin foil hat theory that the business model of LLM's is the same as 1-900-psychic numbers of old.

For just 25¢ 1-900-psychic will solve all your problems in just 5 minutes! Still need help?! No problem! We'll work with you until you get your answers for only 10¢ a minute until your happy!

eerily similar

jmogly · 2025-09-10T01:17:25 1757467045

To me it’s a problem of if a piece of information is not well represented in the training data the llm will always tend towards bad token predictions for related to said information. I think the next big thing in LLM’s could be figuring out how to tell if a token was just a “fill in” or “guess” vs a well predicted token. That way you can have some sort of governor that can kill a response if it is getting too guessy, or atleast provide some other indication that the provided tokens are likely hallucinated.

Maybe there is some way to do it based on the geometry of how the neural net activated for a token, or some other more statistics based approach, idk I’m not an expert.

photonthug · 2025-09-10T20:47:40 1757537260

A related topic you might want to look into here is called nucleus sampling. Similar to temperature but also different.. it's been surprising to me that people don't talk about it more often, and that lots of systems won't expose the knobs for it.

weinzierl · 2025-09-09T17:44:26 1757439866

It sometimes happens with simple things. I once pasted the announcement for an event in Claude to check for spelling and grammar.

It had a small suggestion for the last sentence and repeated the whole corrected version for me to copy and paste.

Only last sentence slightly modified - or so I thought because it had moved the date of the event in the first sentence by one day.

Luckily I caught it before posting, but it was a close call.

toss1 · 2025-09-10T15:47:20 1757519240

Yup, I always take editing suggestions and implement them manually, then re-feed the edited version back in for new suggestions if needed. Never let it edit your stuff directly —— the risk of stealth random errors sneaking in is too great.

Just because every competent human we know would edit ONLY the specified parts, or move only the specified columns with a cut/paste operation (or similar deterministically reliable operation), does not mean an LLM will do the same, in fact, it seems to prefer to regenerate everything on the fly. NO, just NO.

K0balt · 2025-09-11T13:22:10 1757596930

Tool use seems like a much better solution in theory. I wonder how it works out IRL?

throwawayoldie · 2025-09-09T17:28:01 1757438881

> Yesterday I used ChatGPT to transform a csv file. Move around a couple of columns, add a few new ones. Very large file.

I'm struggling with trying to understand how using an LLM to do this seemed like a good idea in the first place.

recursive · 2025-09-09T18:11:01 1757441461

When you have a shiny new hammer, everything around you takes on a nail-like aspect.

spongebobstoes · 2025-09-09T16:11:25 1757434285

the safe way to do this is to have it write code to transform data, then run the code

I expect future models will be able to identify when a computational tool will work, and use it directly

epiccoleman · 2025-09-10T17:56:13 1757526973

I don't mean to be rude, but this sounds like user error. I don't understand why anyone would use an LLM for this - or at least, why you would let the LLM perform the transformation.

If I was trying to do something like this I would ask the LLM to write a Python script, validate the output by running it against the first handful of rows (like, `head -n 10 thing.csv | python transform-csv.py`).

There are times when statistical / stochastic output is useful. There are other times when you want deterministic output. A transformation on a CSV is the latter.

ep103 · 2025-09-11T15:20:15 1757604015

Because it markets and presents itself as deterministic and honest. That's the whole issue. AI is unethically marketed and presented to the public.

epiccoleman · 2025-09-12T01:15:42 1757639742

iPod marketing presented then as a device that made you cool. I just used mine to listen to music though

ep103 · 2025-09-02T19:05:14 1756839914

It wouldn't be a difficult situation if these guys were ethical shops from the get-go, but they aren't, they're trying to staple minimally required ethics on afterwards, and it shows.

Ajedi32 · 2025-09-02T19:20:57 1756840857

LLMs are not moral agents. Any attempt to make them behave that way will necessarily be "staple[ed] on afterwards".

Esophagus4 · 2025-09-02T19:38:04 1756841884

To play devil’s advocate, what ethical safeguards are OpenAI responsible for that they have failed to implement?

This is a wild and difficult to understand technology, even for the people building it. And their safeguards are constantly evolving.

I think you’re attributing to malice what should be attributed to people commercializing a novel technology that is, frankly, being exploited by users.

pavel_lishin · 2025-09-02T21:01:57 1756846917

I'm not the person you're responding to, but I'm more than happy to attribute it to incompetence.

But I don't think that's quite the slam-dunk defense that they're looking for.

Esophagus4 · 2025-09-02T21:15:03 1756847703

I get that argument, but I also don’t think that’s quite fair.

If OpenAI, the market leader with the field’s top talent lured by eye watering pay packages, isn’t competent, then no one really is.

ep103 · 2025-08-04T14:02:44 1754316164

This is prime HR style lying. The response is: Problem statement. Claim that reality is the opposite of the problem statement, with no justification given, despite obvious evidence to the contrary. Statement that if reality doesn't match their claim, the worker is at fault. End of statement.

Dystopian, infuriating, unethical and immoral.

brushfoot · 2025-08-04T14:41:45 1754318505

> While some worry AI will dehumanize the hiring process, we believe the opposite.

Look at the language Coinbase uses. Only their view is a "belief." The opposing view is a "worry." Others are motivated by fear. Only holy Coinbase is motivated by love!

This is, of course, doublethink. We all know that removing humans from the hiring process is, by definition, dehumanizing.

Coinbase's article would have been more palatable if it were truthful:

> Some believe AI will dehumanize the hiring process. We agree, and we're SO excited about that! I mean, we aren't in this business to make friends. We're in it to make cold, hard cash. And the less we have to interact with boring, messy human beings along the way, the better! If you're cold, calculating and transactional like us, sign on the dotted line, and let's make some dough!

But if they were that truthful, fun, and straightforward, they'd probably be more social, and they wouldn't have this dehumanizing hiring process to begin with.

ryandrake · 2025-08-04T15:51:19 1754322679

Companies shouldn't be making business decisions based on "belief" or "worry." Show the research that demonstrates which one is actually true.

simmerup · 2025-08-04T18:36:47 1754332607

Why would you expect them to show you the research? They have no reason. to do so and probably believe you’ll find it distasteful if they did

MichaelRo · 2025-08-04T14:23:07 1754317387

The fact that a communist dictatorship declares itself to be a benevolent people's paradise, doesn't change the brutal reality one bit. And unlike living under a communist dictatorship, we don't have to accept it. I will strongly vote for those who make this shit illegal.

ep103 · 2025-07-23T17:21:21 1753291281

SWIM worked as a PM at a company that decided to redo their UI. They ran into an issue on internal roll out, where they discovered their support team for years had been doing sql injection on a specific form in the UI, in order to run reports on the company's database. They had to stop the roll out, and productionize the support team's (very valid) use cases in order to remove the sql injection form.

ep103 · 2025-07-21T04:51:04 1753073464

I think you've drawn the wrong conclusions from the history of the web.

The web started out idealistic, and became what it did because of underregulated market forces.

The same thing will happen to ai.

First, a cool new technology that is a bit dubious. Then a consolidation, even if or while local models proliferate. Then degraded quality as utility is replaced with monetization of responses, except in an llm you wont have the ability to either block ads or understand the honesty of the response.

sameerds · 2025-07-21T05:14:37 1753074877

> The web started out idealistic, and became what it did because of underregulated market forces.

> The same thing will happen to ai.

Exactly! Let the AI market deal with that crap ... all I hope is that AI will get all these people off my lawn!

SV_BubbleTime · 2025-07-21T05:22:58 1753075378

[flagged]

spyckie2 · 2025-07-21T08:37:47 1753087067

Not the commenter but saying unregulated market does not imply that a regulated market would solve it. But I also agree that unregulated market forces is the best way to describe what happened to the internet.