Hacker Newsnew | past | comments | ask | show | jobs | submit | ctas's commentslogin

Can you share a bit more on the small LLMs you've trained? I'm interested in the applicability of current consumer hardware for local training and finetuning.


I'm not the AI expert in the company but one of my colleagues creates image segmentation models for our specific use case. I've been able to run the PyTorch training code on my computer without any issues. These are smaller models that are destined to run on Jetson boards so they're limited compared to larger LLMs.

edit: just to be clear, I can't train anything competitive with even the smallest LLMs.


Are consumer cards also benefiting from the improvements or only datacenters?


For me personally, blender uses ROCm and ROCm hiprt for Cycles tracing and cycles raytracing acceleration. I am using a Radeon RX 6800 and AMD Ryzen AI 7 350, and it works on Linux. ROCm HIPRT really does speed up cycles rendering (at the cost of more Vram usage). Of course, using the AMD driver on Linux, you get access to the system's RAM with GTT.


I've shared this example in another thread, but it fits here too. Few weeks ago, I talked to a small business owner who found out that Google's AI is telling users his company is a scam, based on totally unrelated information where a different, similarly named brand is mentioned.

We actually win customers who's primarily goal is getting AI to stop badmouthing them.


A desktop environment for Linux, visually inspired by OSX Snow Leopard with a touch of contemporary. Coming with compositor, apps like dock, finder, status bar, and a UI framework like AppKit. Scratching my own itch and would love to see if it can gain traction. Still in the early innings though.


what framework/techstack are you using? i'd love to see something built on top of GNUstep which is close to what OSX is originally based on. (don't know how much of that is still found in Snow Leopard)


The goal is not to use a similar tech stack e.g. GNUstep. Instead I'm focusing more on outcome - a desktop environment with a similar degree of polish and functionality without the need for third party tools.

To stay competitive and iterate fast I'm adding a high-level JS/CSS API on top of Wayland, think AppKit + SwiftUI. If you look over my shoulder it might look like I'm making a webapp, but on a custom browser.


We (Geostar.ai) work with many brands and companies that have experienced near-death situations caused by Google's AI Overviews. The negative impact this feature has had on people's livelihoods is heartbreaking to witness.

Just today, I met with a small business owner who showed me that AIO is warning users that his business is a scam, based on bogus evidence (some unrelated brands). It's a new level of bullshit. There's not much these businesses can do other than playing the new GEO game if they want to get traffic from Google.

Who knows if Google will even present any search results other than AIO a few years from now.


A desktop environment for Linux, inspired by macOS. Coming with compositor, apps like dock, finder, status bar, and a UI framework like AppKit.


Would be interested to learn more about how and why this was coded in Common Lisp, in particular which value it provides specific to the problem being solved compared to other languages.


Ability to write at a very high level (and macros), incredibly fast when compiled, ability to use the repl on a live server for diagnosis and patching functions, language design choices work extremely well for parallel recursive hierarchical inference.


Please take into account that Bluesky is still young and run by a small team which is moving fast. Ignoring a few reports that they consider non-critical is not a strong negative signal, especially not during a time of rapid growth and while they're in beta and invite-only. They just cracked the million users and are probably navigating through a lot of chaos on a daily basis. Those of us who've been in similar situations know that this is normal.


> Slack uses PHP for most of its server-side application logic […].

Slack migrated to Hacklang in 2016 [1].

[1] https://slack.engineering/hakana-taking-hack-seriously/ "We started migrating to a different language called Hack in 2016."


Isn't Hack basically compiled PHP so that it is faster for FB's use case? I understand it's not technically PHP, but I imagine it is effectively still PHP, or is it more like what C++ is to C?

I understand PHP is fast adding any actual language features that Hack has over it?


At one point it was, but since then it’s become it’s own language with a completely new (JIT-based) implementation


Shameless plug: I've recently started my own transactional email service (https://www.markix.com), primarily targeting small senders, after having been a very happy Postmark customer for a long time. Our service is still in closed beta but delivering live emails.

I run a couple other businesses and moved all of my transactional email sending over to Markix.

Would love to have a chat with anyone that might be starting a new project and is open to try out a new mail service (mail in bio).


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: