New Comments | Hacker News

		danielhanchen 0 minutes ago \| parent \| context \| on: Qwen3-Coder-Next I haven't done benchmarking yet (plan to do them), but it should be similar to our post on DeepSeek-V3.1 Dynamic GGUFs: https://unsloth.ai/docs/basics/unsloth-dynamic-2.0-ggufs
		JCattheATM 0 minutes ago \| parent \| context \| on: Sandboxing AI Agents in Linux > certainly not bubblewrap, Eh, it might be bubblewrap given it's what flatpak uses.
		nailer 0 minutes ago \| parent \| context \| on: Y Combinator will let founders receive funds in st... The post you are responding to discusses Stablecoin regulation multiple times.
		tbmtbmtbmtbmtbm 0 minutes ago \| parent \| context \| on: The Everdeck: A Universal Card System (2019) yeah, I agree. This is a very cool idea but the visual design of the cards needs some tightening up
		TacticalCoder 0 minutes ago \| parent \| context \| on: Data centers in space makes no sense > But I think that specifically the way Musk is trying to position it, the moon would be an even harder sell. I agree. I would be quite a moonshot.
		koakuma-chan 0 minutes ago \| parent \| context \| on: When rust ≠ performance. a lesson in developer exp... What is not performant about tokio? Do you know a better async runtime? I also heard tokio's "multi-thread" scheduler had some issues.
		CryptoBanker 0 minutes ago \| parent \| context \| on: Qwen3-Coder-Next That’s not an Anthropic problem, that’s a problem with whomever you work for.
		936966931646863 0 minutes ago \| parent \| context \| on: Show HN: Safe-now.live – Ultra-light emergency inf... What is absurd about that?
		danielhanchen 0 minutes ago \| parent \| context \| on: Qwen3-Coder-Next Oh we wrote about it here: https://unsloth.ai/docs/basics/unsloth-dynamic-2.0-ggufs
		nailer 1 minute ago \| parent \| context \| on: Y Combinator will let founders receive funds in st... After ACH, which I’m assuming you got for free, US to Poland on Wise is still multiple orders of magnitude more expensive.
		Gabrielpeter 1 minute ago \| parent \| context \| on: Security warning: "CZR Exchange" – avoid connectin... I'm really amazed at czr exchange functionality. It's really great.
		danielhanchen 1 minute ago \| parent \| context \| on: Qwen3-Coder-Next Nice! Yes Q8_0 is similar - the others are different since they use a calibration dataset.
		toomuchtodo 1 minute ago \| parent \| context \| on: HHS to Expand Faith-Based Addiction Programs for H... Someone will have to file suit, as is tradition under this administration.
		tombert 1 minute ago \| parent \| context \| on: Y Combinator will let founders receive funds in st... Despite still not really showing any utility these tech companies want so so so much for cryptocurrency to catch on. It feels like the entirety of cryptocurrency, outside of being a thing people used to buy drugs, has been an example of Chesterton's Fence, with half of Silicon Valley in denial of this fact.
		justaboutanyone 1 minute ago \| parent \| context \| on: Qwen3-Coder-Next Running llama.cpp rather than vLLM, it's happy enough to run the FP8 variant with 200k+ context using about 90GB vram
		EdNutting 1 minute ago \| parent \| context \| on: Lessons learned shipping 500 units of my first har... And this, ladies and gentlemen, is why SaaS investors don’t understand how to invest in deeptech/hardtech, despite current trends. Like this guy, they have no clue about the differences in business model, except they’re not founders so they don’t go through the pain and they mostly don’t learn. Hats off to the author for making it through! What a start to the journey!
		gpt5 1 minute ago \| parent \| context \| on: Data centers in space makes no sense The ISS is designed to emit 126kW of heat radiation between the active cooking systems and the solar array cooling system.
		icepush 1 minute ago \| parent \| context \| on: 221 Cannon is Not For Sale Probably collecting application fees from people interested in renting it.
		crazygringo 2 minutes ago \| parent \| context \| on: 1 kilobyte is precisely 1000 bytes? Except it's not because it's constantly ambiguous in computing. E.g. Macs measure file sizes in powers of 10 and call them KB, MB, GB. Windows measures file sizes in powers of 2 and calls them KB, MB, GB instead of KiB, MiB, GiB. Advertised hard drives come in powers of 10.
		mikelio 2 minutes ago \| parent \| context \| on: Security warning: "CZR Exchange" – avoid connectin... Steady, strong, and reliable.
		spencerflem 2 minutes ago \| parent \| context \| on: The Everdeck: A Universal Card System (2019) Yeah :c I feel the same way. They’ve made a variant with more traditional poker deck look but the same rank/suits of the ever deck that I’m excited to try one day
		aunty_helen 2 minutes ago \| parent \| context \| on: Data centers in space makes no sense Engine bays have a lot of design go into where to keep heat and where to get rid of it. You can look up thermal coatings and ceramics etc.
		lokar 2 minutes ago \| parent \| context \| on: Data centers in space makes no sense Can you smoke ketamine?
		_el1s7 2 minutes ago \| parent \| context \| on: France dumps Zoom and Teams as Europe seeks digita... Europe? Uh, I think you mean the whole world.
		chillee 2 minutes ago \| parent \| context \| on: GPT-5.2 and GPT-5.2-Codex are now 40% faster this is almost certainly not being done on cerebras
		rishabhaiover 2 minutes ago \| parent \| context \| on: Threat of New AI Tools Wipes $300B Off Software an... is this the beginning of the end?
		kergonath 2 minutes ago \| parent \| context \| on: xAI joins SpaceX Right. The alternative is not to send materials from Earth for processing in space, that would be stupid. We send finished stuff, which were manufactured on the ground. But you don’t mine finished widgets from asteroids. You mine ore that needs refining and processing before being used to manufacture things. This ore is orders of magnitude heavier than the finished products, never mind all that’s required to do anything useful with it.
		jmward01 2 minutes ago \| parent \| context \| on: FlashAttention-T: Towards Tensorized Attention RNNs have two huge issues: - long context. Recurrence degrades the signal for the same reason that 'deep' nn architectures don't go much past 3-4 layers before you need residual connections and the like - (this is the big one) training performance is terrible since you can't parallelize them across a sequence like you can with causal masked attn in transformers On the huge benefit side though you get: - guaranteed state size so perfect batch packing, perfect memory use, easy load/unload from a batch, O(1) of token gen so generally massive performance gains in inference. - unlimited context (well, no need for a concept of a position embedding or similar system) Taking the best of both worlds is definitely where it is at for the future. An architecture that can train parallelized, has a fixed state size so you can load/unload and patch batches perfectly, unlimited context (with perfect recall), etc etc. That is the real architecture to go for.
		Emmanuel111 2 minutes ago \| parent \| context \| on: Security warning: "CZR Exchange" – avoid connectin... Czr is nice and really work for me
		reliefcrew 3 minutes ago \| parent \| context \| on: Ask HN: Cheap laptop for Linux without GUI (for wr... > I don't know how sleep/wake works from a text mode TTY machine if that's what you mean. Usually, I expect, by just issuing a command... e.g. `systemctl suspend`
		More