More

willmarquis · 2026-02-11T00:56:27 1770771387

The thread is missing the forest for the trees. The interesting bet here isn't git checkpoints—it's that someone is finally building the observability layer for agent-generated code.

Most agent frameworks (LangChain, Swarm, etc.) obsessed over orchestration. But the actual pain point isn't "how do I chain prompts"—it's "what did the agent do, why, and how do I audit/reproduce it?"

The markdown-files-in-git crowd is right that simple approaches work. But they work at small scale. Once you have multiple agents across multiple sessions generating code in production, you hit the same observability problems every other distributed system hits: tracing, attribution, debugging failures across runs.

The $60M question is whether that problem is big enough to justify a platform vs. teams bolting on their own logging. I'm skeptical—but the underlying insight (agent observability > agent orchestration) seems directionally correct.

doctoboggan · 2026-02-11T01:45:16 1770774316

@dang with the launch of open claw I have seen so much more LLM slop comments. I know meta comments like mine aren't usually encouraged, but I think we need to do something about this as a community. Is there anything we can do? (either ban or at least requiring full disclosure for bot comments would be nice).

EDIT: I suspect the current "solution" is to just downvote (which I do!), but I think people who don't chat with LLMs daily might not recognize their telltale signs so I often see them highly upvoted.

Maybe that means people want LLM comments here, but it severely changes the tone and vibe of this site and I would like to at least have the community make that choice consciously rather than just slowly slide into the slop era.

Zacharias030 · 2026-02-11T01:59:07 1770775147

Parent comment has the rhythm of an AI comment. Caught myself not realizing it until you mentioned it. Seems like I am more in tune with LLM slop on twitter, which is usually much worse. But on second sight it's clear and it also shows the comment as having no stance, and very generic.

@dang I would welcome a small secondary button that one can vote on to community-driven mark a comment as AI, just so we know.

gabriel-uribe · 2026-02-11T03:08:59 1770779339

The moltbook-ification of every online forum seems inevitable this year. I wish we had a counter to this.

neom · 2026-02-11T18:43:33 1770835413

I've been thinking about this, one solution I wonder if to put a really hard problem in the sigh up flow that humans couldn't solve, if it's solve in the signup, it's a bot, not sure how tf to actually basically captchas flip, however I suspect this would only work for so long.

sebmellen · 2026-02-11T01:55:08 1770774908

It's the dead internet theory in action. Every time I see slop I comment on it. I've found people don't always like it when you comment on it.

doctoboggan · 2026-02-11T02:20:22 1770776422

Yes I usually just bite my tongue and downvote, but with the launch of open claw I think the amount of slop has increased dramatically and I think we need to deal with it sooner than later.

sebmellen · 2026-02-11T05:52:37 1770789157

Do you really think openclaw is to blame? I shudder to think of how few protections HN has against bots like that.

fblp · 2026-02-11T10:18:56 1770805136

Thank you for pointing this out. I didn't catch that the parent comment was ai either and upvoted it. Changed it to a downvote seeing your comment and realizing it the comment did indeed have many AI flags.

ijidak · 2026-02-11T02:05:30 1770775530

Nothing about the parent comment suggests AI, except the em dash, but that's just a regular old punctuation that predates AI.

doctoboggan · 2026-02-11T02:18:08 1770776288

How much experience do you have interacting with LLM generated prose? The comment I replied to sets off so many red flags that I would be willing to stake a lot on it being completely LLM generated.

It's not just the em dashes - its the cadence, tone and structure of the whole comment.

toraway · 2026-02-11T06:29:57 1770791397

Yeah it's really frustrating how often I see kneejerk rebuttals assuming others are solely basing it on presence of em-dashes. That's usually a secondary data point. The obvious tells are more often structure/cadence as you say and by far most importantly: a clear pattern of repeated similar "AI smell" comments in their history that make it 100% obvious.

clbrmbr · 2026-02-11T02:25:23 1770776723

I didn’t catch it until seeing these flag-raising comments… checking the other comments from the last 8 hours, it’s Claw for sure.

drc500free · 2026-02-11T02:50:18 1770778218

Punchy sentence. Punchy sentence. It's not A, it's B.

The actual insight isn't C, it's D.

slopbrain · 2026-02-11T01:54:15 1770774855

You're absolutely right! It's not the tooling, it's the platform.

SirensOfTitan · 2026-02-11T02:17:35 1770776255

This sounds awfully like an LLM generated comment.

I suppose it was just a matter of time before this kind of slop started taking over HN.

kristianc · 2026-02-11T01:49:15 1770774555

> Once you have multiple agents across multiple sessions generating code in production, you hit the same observability problems every other distributed system hits: tracing, attribution, debugging failures across runs.

This has been the story for every trend empowering developers since year dot. Look back and you can find exactly the same said about CD, public cloud, containers, the works. The 'orchestration' (read compliance) layers always get routed around. Always.

baggy_trough · 2026-02-11T00:57:38 1770771458

It's not this, it's that?

jascha_eng · 2026-02-11T01:30:05 1770773405

verbatim llm output with little substance to it. HN mods don't want us to be negative but if this is what we have to take serious these days it is hard to say anything else.

I guess I could not comment at all but that feels like just letting the platform sink into the slopacolypse?

RiverCrochet · 2026-02-11T01:59:28 1770775168

A. B isn't C—it's D1.

E. But F, G: H1, H2...

I. J—but D2 seems K.

paodealho · 2026-02-11T01:32:24 1770773544

Yes—it is!

rockwotj · 2026-02-11T02:46:21 1770777981

I thought everyone was just using open telemetry traces for this? This is just a classic observability problem that isn’t unique with agents. More important yes, but not unique functionally.

loveparade · 2026-02-11T03:28:26 1770780506

Can you explain more how otel traces solve this problem? I don't understand how it's related.

Aeolun · 2026-02-11T01:04:29 1770771869

Ok, I’ll grant you that if they can get agents to somehow connect to other’s reasoning in realtime that would be useful. Right now it’s me that has to play reasoning container.

kaicianflone · 2026-02-11T02:30:31 1770777031

This is interesting. I’m experimenting with something adjacent in an open source plugin, but focused less on orchestration and more on decision quality.

Instead of just wiring agents together, I require stake and structured review around outputs. The idea is simple: coordination without cost trends toward noise.

Curious how entire.io thinks about incentives and failure modes as systems scale.

tjlanmp · 2026-02-11T02:03:17 1770775397

That is a sharp observation———it is the observability that matters! The question arises: Who observes the observers? Would you like me to create MetaEntire.ai———an agentic platform that observes Entire.io?

zack6849 · 2026-02-11T02:47:25 1770778045

I think you need a few more em-dashes there to be safe

brunoborges · 2026-02-11T03:04:39 1770779079

I think we need an Agent EE Server Platform. :P

backbay-machine · 2026-02-11T02:52:56 1770778376

Wholeheartedly agree. We have been working hard at a solution towards this and welcome any feedback and skepticism: https://github.com/backbay-labs/clawdstrike

willmarquis · 2025-05-23T04:54:52 1747976092

Do you know when this will be available on Basalt? They didn't communicate on it yet

willmarquis · 2025-05-23T04:52:51 1747975971

Waiting for the ranking on the lmsys chat arena! The only source of truth

willmarquis · 2025-05-23T04:50:44 1747975844

Bellmac-32 went 32-bit CMOS when everyone else was still twiddling 8-bit NMOS, then got shelved before the afterparty. IEEE giving it a milestone in 2025 is basically a lifetime achievement trophy for the domino-logic DNA inside every phone SoC today late, but deserved

willmarquis · 2025-05-23T04:48:05 1747975685

Flatpak’s biggest bug isn’t in the code, it’s the bus factor.

> Tons of features are stuck in merge-request limbo because there just aren’t enough reviewers, and if we don’t swap some “+1”s for actual PR reviews (or funding), we’ll be shipping apps in 2030 with a sandbox frozen in 2024 while everything else rides OCI.

willmarquis · 2025-05-21T23:51:24 1747871484

This is just a reminder that memory isn’t just a constraint, it’s a resource.

michaelsbradley · 2025-05-22T01:54:48 1747878888

What resources are available (or not) and in what quantities are the most basic constraints for solving a problem/s with a computer.

willmarquis · 2025-05-20T04:25:46 1747715146

Exposing unauthenticated /heapdump endpoints in production is a rookie mistake-especially for a service handling sensitive government comms. The presence of MD5 hashes and legacy tech like JSP just adds to the picture of poor security hygiene. This breach is a textbook case of why defense-in-depth and regular audits are non-negotiable.

Traubenfuchs · 2025-05-20T13:09:53 1747746593

Don't hate on JSP.

Java Server Pages is now Jakarta Server Pages, part of Java EE (Jakarta EE) and it's latest version 11 was released just a year ago. Spring Framework 7 will be released by the end of 2025 and be based on it. Tomcat 11 is already based on it as well.

And all of this is based on the thriving Java ecosystem.

Version 12 is under development.

If they kept their stuff updated, nothing about this is legacy. It just declined in popularity.

You can build insecure trash and expose unprotected endpoints with next.js, or whatever is currently considered state of the art, as well.

willmarquis · 2025-05-20T04:22:35 1747714955

Really impressive evolution of a crucial service. The architectural and UX improvements are well thought out, especially the focus on resilience and scalability. Love the transparency around the decision-making process, too-Troy’s commitment to keeping HIBP fast, free, and useful is a great example of public-interest software done right. The migration to .NET 8 and use of Cloudflare for caching shows how mature and modern the stack is becoming.

willmarquis · 2025-05-20T04:19:58 1747714798

Finally, someone had the courage to disrupt the tyranny of the modulo operator. Who needs n % 2 === 0 when you can invoke a large language model and incur network latency, token limits, and API costs to answer the age-old question: is this number even? Truly, we’re living in the future.

willmarquis · 2025-05-20T04:18:57 1747714737

Interesting take. The visualization of the inverse tree highlights just how sparse the “preimage space” is under Collatz iterations. The idea that this sparsity contributes to the apparent randomness is compelling. I’m curious whether modeling the process modulo powers of 2 and 3, or via 2-adic analysis, could formalize some of these heuristic observations. Also, the assumption that most numbers “fall off” rapidly aligns with empirical behavior, but it’s still not clear how to bound exceptional trajectories.