Skip to main content

Posts

Recent posts

New top story on Hacker News: Inside FAISS: Billion-Scale Similarity Search

Inside FAISS: Billion-Scale Similarity Search 5 by tohms | 0 comments on Hacker News. Author here. I wrote this as a visual companion to the 2017 FAISS paper (Johnson, Douze, Jégou), focused on the parts I found hardest to grok from text alone. The article covers a subset of what FAISS does, with the paper as the source of truth. NSG, FastScan, IMI are not covered here, they'll get their own articles. I'd be especially interested in feedback on: - the IVFPQ / IVFADC explanation, particularly the LUT reuse argument - whether the GPU part captures enough of the actual complexity Happy to answer questions.

New top story on Hacker News: Show HN: Cost.dev (YC W21) – making agents cost-aware and cheaper to call

Show HN: Cost.dev (YC W21) – making agents cost-aware and cheaper to call 9 by akh | 1 comments on Hacker News. We launched Infracost on HN five years ago ( https://ift.tt/0PSHGKW ) where our CLI generated cost estimates for infra-as-code, e.g. "this Terraform PR adds $400/mo". The idea was to shift cloud costs (FinOps) left, so engineers get visibility of costs before deployment and make better decisions. Earlier this year we started seeing agent traffic in our logs and it looked like coding agents were calling our CLI. But that CLI wasn't designed with coding agents in mind. We went down a philosophical rabbit hole to see if a CLI is even needed anymore given that Claude, Copilot et al. already follow best practices. Ultimately we decided to create a new CLI from the ground up with coding agents in mind for two reasons: 1. We optimized the CLI for agent callers and cut Claude's output token usage by up to 79% and API cost by up to 67% versus a bare-Claude baselin...