Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Great read. I’ve been modeling developer activity as a time series key value system where each developer is a key and commits are values. Faced the same issues: logs grow fast, indexes get heavy, range queries slow down. How do you decide what to drop when compacting segments? Balancing freshness and retention is tricky.


I'm curious how much data you have? I have 12 years of dev data and reports are generated in seconds, if not milliseconds. What is your key patterns? It sounds like a key-design problem.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: