Hacker Newsnew | past | comments | ask | show | jobs | submit | minimalengineer's commentslogin

Two years ago, I worked for a company that had its own proprietary AI system for processing PDFs. While the system handled document ingestion, its real value was in extracting and analyzing data to provide various insights. However, one key requirement was rendering documents in HTML with as close to a 1:1 likeness as possible.

At the time, I evaluated multiple SDKs for both OCR and non-OCR PDF conversions, but none matched the accuracy of Adobe Acrobat’s built-in solution. In fact, at one point (don’t laugh), the company resorted to running Adobe Acrobat on a Windows machine with automation tools to handle the conversion. Using Adobe’s cloud service for conversion was not an option due to the proprietary nature of the PDFs. Additionally, its results were inconsistent and often worse compared to the desktop version of Adobe Acrobat!

Given that experience, I see this primarily as an HTML/text conversion challenge. If Gemini 2.0 truly improves upon existing solutions, it would be interesting to see a direct comparison against popular proprietary tools in terms of accuracy.


Great device — lasted 4 years, woke me at 5 AM without disturbing my kids, and handled notifications well. Battery life was about a week, and it was swim-proof. That said, it was cheap... I hope this new version isn’t part of the “dumb” device trend where people spend $500 just to detox, thinking the price will force commitment.


At the same time, I hope it's priced high enough so that the company can thrive without taking external funding. PE and VC fuck everything up.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: