The best Hacker News stories from Show from the past day
Latest posts:
Show HN: Unsiloed AI – #1 on olmOCR-Bench
Most of the document parsers fail on real world challenges like complex tables, handwritten documents, historical document scans, equations, multi-column layouts, complex reading order, etc. We built Unsiloed Parser to handle exactly these cases.<p>Our latest parser v3.1 achieved #1 rank and scored 88.0 strict pass-rate on olmOCR-Bench. We ran the evaluation across 1,403 PDFs and 8,413 unit tests using the unmodified upstream Allen AI scorer (olmocr==0.4.27) and found Unsiloed beats 18 other OCR services, including GPT-5.5, Claude Opus 4.7, LlamaParse, Reducto, Azure Document Intelligence, AWS Textract, and Unstructured.<p>When we dug deeper into the failure cases, we found many errors were not OCR errors but things like \frac vs \dfrac, whitespace differences, or equivalent LaTeX renderings. We ran a secondary LLM-as-Judge evaluation to classify real misses vs semantic equivalents, which lifts the corrected score to 94.8 (explained deeply in the blog post).<p>Blog with full methodology and examples: <a href="https://www.unsiloed.ai/blog/unsiloed-ai-achieves-1-rank-on-olmocr-bench-2">https://www.unsiloed.ai/blog/unsiloed-ai-achieves-1-rank-on-...</a><p>Evaluation Code for reproducibility:
<a href="https://github.com/Unsiloed-AI/unsiloed-olmocr-benchmark" rel="nofollow">https://github.com/Unsiloed-AI/unsiloed-olmocr-benchmark</a><p>Feel free to post your messiest PDFs in the comment and we'll run it through Unsiloed parser and share the output here.
Show HN: skills-for-humanity – 171 structured reasoning skills for Claude Code
Show HN: TryPost – open-source Social Media Scheduler
Show HN: A website that tracks every stock trade Congress makes
Congressional trading data is relatively commoditized, but I couldn't find any open-source version with the features I wanted.<p>The data is lagged (median 28 days from trade to disclosure, and 19% miss this deadline), but there's still interesting patterns to explore.<p>Repo: <a href="https://github.com/kadoa-org/congress-trading-monitor" rel="nofollow">https://github.com/kadoa-org/congress-trading-monitor</a><p>Let me know if you have any suggestions for improvements!
Show HN: Fungible – A local personal finance app in the terminal
Fungible is a terminal-based personal finance app that fills the Mint-shaped void in my life. It runs using your own plaid credentials (optional) and has its own integrated chatbot (also optional and BYO key).<p>You connect banks via Plaid or import CSVs. Transactions get auto-categorized by rules you define. On top of normal categories there's a flexibility layer (fixed / flexible / discretionary) so you can see at a glance what's actually controllable spending. There are also tags (also separate from categories) for isolating things like trips or hobbies.<p>The financial health screen does savings rate, liquidity runway, and FIRE projection with adjustable assumptions. Probably overkill but I like it.<p>It also has an MCP server so Claude/ChatGPT can talk to you about your finances, create rules/tags, etc. That’s always the most annoying thing for me when trying a new personal finance tool. Hopefully this brings down the barrier to usefulness. The agent/chatbot in the app has the same tools as the Claude/ChatGPT would have via the MCP.
Show HN: Volt – front end tooling for Phoenix that runs inside the BEAM
Show HN: Volt – front end tooling for Phoenix that runs inside the BEAM
Show HN: OpenBrief – Local-first video downloader/summarizer
OpenBrief is basically a GUI for yt-dlp with some AI on top — paste a link, it downloads locally, and transcription and voice generation run with local AI on your machine. Summaries and chat over the transcript use an LLM, which is bring-your-own-key for now. It's open source and free.
Show HN: OpenBrief – Local-first video downloader/summarizer
OpenBrief is basically a GUI for yt-dlp with some AI on top — paste a link, it downloads locally, and transcription and voice generation run with local AI on your machine. Summaries and chat over the transcript use an LLM, which is bring-your-own-key for now. It's open source and free.
Show HN: Kanban CLI (A local-first, agent-first task manager for the terminal)
Hello HN,<p>Ever since agents have become increasingly common in development, I've been scratching my head as to how to control their randomness. Recently, I decided to emulate an issue-tracking and project-management tool for agent-driven workflows.<p>Kanban is a Rust-based coordination layer designed to provide a feature-rich terminal interface and enforce rigorous workflows. It aims to be versatile and extendable, made to be tailored to any preferred flow. It comes with full git integration and guardrails such that only what truly benefits a project can go through.<p>The workflow boils down to 4 steps:<p>1. The model reads the skill to contextualize the requirements<p>2. It authenticates and receives a strict, schema-validated JSON payload outlining exact files, context, and acceptance criteria<p>3. Implementation is performed within an automatically isolated Git worktree and branch. The tool tracks progress (e.g., verifying all files were edited) before the task is submitted for review<p>4. A reviewer (preferably a human) evaluates the submission and manually transitions the task to "Done," which triggers the final merge and cleans up the task-specific environment.<p>The tool significantly decreases the agent development time, while increasing the human planning phase.<p>There is more to it than I can cover here, so I'd be happy to answer any questions about the architecture, the workflow, or the insights I gained while using it. For more information, I recommend skimming the README, which acts as an index to all documentation files.<p>Repo: <a href="https://codeberg.org/hydrafog/kanban" rel="nofollow">https://codeberg.org/hydrafog/kanban</a>
Show HN: The Front Page – Newspaper-style front page for Hacker News
Show HN: Geomatic – A command-driven geometry studio enabled with autodiff
All commands have the format `output = \func inputs` or just `\function inputs`. Points and scalars are built on the fly. Eg `\line a b` to an empty canvas creates points `a` and `b`, and joins them with a line.<p>One can use broadcasting semantics similar to NumPy and PyTorch in a visual setting (imagine creating a list of circles where one dim corresponds to radius and another to the center). One can also use backpropagation, run gradient descent or visualize vector fields. Almost everything is reactive so changing a variable updates all of the downstream geometry. It also allows anyone to write and load their own visualization, which can be broadcasted and differentiated through.<p><a href="https://www.tinyvolt.com/geomatic/examples/getting-started" rel="nofollow">https://www.tinyvolt.com/geomatic/examples/getting-started</a>
Show HN: Geomatic – A command-driven geometry studio enabled with autodiff
All commands have the format `output = \func inputs` or just `\function inputs`. Points and scalars are built on the fly. Eg `\line a b` to an empty canvas creates points `a` and `b`, and joins them with a line.<p>One can use broadcasting semantics similar to NumPy and PyTorch in a visual setting (imagine creating a list of circles where one dim corresponds to radius and another to the center). One can also use backpropagation, run gradient descent or visualize vector fields. Almost everything is reactive so changing a variable updates all of the downstream geometry. It also allows anyone to write and load their own visualization, which can be broadcasted and differentiated through.<p><a href="https://www.tinyvolt.com/geomatic/examples/getting-started" rel="nofollow">https://www.tinyvolt.com/geomatic/examples/getting-started</a>
Show HN: Write your BPF programs in Go, not C
Show HN: Write your BPF programs in Go, not C
Show HN: A platform to find people to jam on side projects with
I have always found it funny how challenging it can be to find people to jam on side projects with.<p>There are literally entire sub-Reddits where people post looking for someone to work on a project with. That is super inefficient. There are also newsletters for this (also pretty inefficient).<p>Let's Jam is my attempt to solve this.<p>This is NOT a cofounder matching platform. The idea is to connect people with ideas and skills so they can jam on them together. If they end up becoming cofounders, cool, but that is up to them. This is also NOT a place for freelancers to hunt job opportunities. Again, the platform is for people who have an idea or a skill and want to work on something together.<p>How it works:<p>> You either a) find a project and request to jam on it with that person, b) post a project and wait for someone to request to jam on it with you, c) claim an idea and wait for someone to request to jam on it with you.<p>> Once someone requests to jam with you, you'll get an email, and you can vet them via LinkedIn or their past work. If you think they'll be a good fit then accept their request and they'll reach out to you.<p>> That's it. Simple.<p>Any feedback is greatly appreciated!
Show HN: Twixt – transform one word into another in four moves
I made this game while working on a different project about teaching English spelling. I was reading about homophones and got struck by how much a homophone can transform the shape of a word, so I started experimenting with little games built on that.<p>I added a few more transforms, anagrams, verb/tense changes, but the answers kept coming out too obvious. I couldn't distort the word enough to make it interesting. The breakthrough was compound pairs. Jumping from one word to another through their compound (sea → horse, via seahorse) really obscures the path and that's when it suddenly got fun and unpredictable.<p>I've been sharing it with friends. I'm in the UK so mostly UK testers, fair warning that a couple of the homophones may lean British.<p>They've been playing daily and seem hooked, so it felt worth posting here. It's one puzzle a day mainly so I actually have time to hand pick puzzles that have a satisfying path. Today's puzzle is on the easy side but they can get really tricky. The name is from 'betwixt', the whole game is about moving between two words. I did clock afterwards that there's a 60s board game with the same name, but they're pretty different things.
Show HN: Git-based front-end interface for Hugo
I built simple Git-based CMS for Hugo static site generators.
No backend, no database, no configs. Just login with Github, select your repo and start editing.
If you use Hugo for blogging, feel free to check it out at <a href="https://github.com/arashthr/hugo-flow" rel="nofollow">https://github.com/arashthr/hugo-flow</a><p>> Why did you make it?
I wanted to be able to write posts on my phone, and it's not easy to do that with Markdown. Adding images is also always a headache: copy them, minimize and compress them, remember the correct syntax for images, and type the path.<p>> How did you make it?
I used Google Stitch for the design and Antigravity for coding.<p>> What about alternatives?
Check out Front-end interfaces page in Hugo website. None of them is as simple and dumb as I wanted.<p>> But is it actually working?
It's not perfect, but it works well for me. You're welcome to open issues if you find problems.
Show HN: Anyone interested in a tool helps to explore C++ ASTs
I built this software to help users who need to do static code analysis for C/C++ codebase. This can help you to explore the ASTs interactively. It can replace the Clang ast dump completely and also you can search the code/ASTs in a structual way. For example you can search a function which matches specific name or number of paremeters.
Show HN: Anyone interested in a tool helps to explore C++ ASTs
I built this software to help users who need to do static code analysis for C/C++ codebase. This can help you to explore the ASTs interactively. It can replace the Clang ast dump completely and also you can search the code/ASTs in a structual way. For example you can search a function which matches specific name or number of paremeters.