The best Hacker News stories from Show from the past day
Latest posts:
Show HN: Demon – open-source real-time music diffusion engine, 25Hz local GPU
YO,<p>I’m Ryan, lead author. I’ve been contributing open source generative audio stuff for a while now, audio reactive Comfy nodes, extended ACEstep support in Comfy, etc.. I just opened-sourced a new audio project that I've been working on for a few months and I want to tell y'all about it.<p>WHAT IS IS
DEMON: Diffusion Engine for Musical Orchestrated Noise<p>This is StreamDiffusion but with audio instead of images, and ACEStep 1.5 instead of Stable Diffusion. It’s responsive enough that you can play it like an instrument, and remix in near real-time.<p>I also distilled the ACEStep VAE: it’s faster at the expense of some quality.<p>I also trained something like 200 lora/dora for ACEStep 1.5 and 1.5XL: I will release these in batches of 5 or 10 or something<p>WHY IT IS
Two reasons:
1) Making music is an inherently real-time activity
2) Why not bro<p>SOME RUNTIME CAPABILITIES
-Real-time remixing of songs
-Denoise, structure, timbre strength adjustment
-Reference track swapping
-Prompt blending, parameter scheduling with curves
-LoRA hotswapping, runtime strength adjustment
-Latent channel (research preview)
-Feedback
-Vocal stem cutting/pasting with melformer (s/o u/BuffMcBigHuge)
-XL support (its less stable, working out VRAM pressure issues and whatnot)
-Lyrics/vocals SOON
-Spectral quality SOON
-Other stuff<p>SOME LIMITATIONS
-ACEStep (correctly) ‘begins’ and ‘ends’ the song. This system is optimized for remixing either an entire song, or continuously remixing a loop. The loop works fine, but this is not pure, continuous music. Autogression wins here.
-Many others, for a more exhaustive list, please see the full writeup via the project page
-Please let us know if you find any, I would love to try and address them if possible<p>LINKS
My YouTube (DEMON tutorial): <a href="https://youtu.be/FBv1b5gmjcE" rel="nofollow">https://youtu.be/FBv1b5gmjcE</a>
Github: <a href="https://github.com/daydreamlive/DEMON" rel="nofollow">https://github.com/daydreamlive/DEMON</a>
Project page: <a href="https://daydreamlive.github.io/DEMON" rel="nofollow">https://daydreamlive.github.io/DEMON</a>
LoRA: <a href="https://civitai.com/models/2416425/acestep-loras" rel="nofollow">https://civitai.com/models/2416425/acestep-loras</a>
DreamVAE: <a href="https://huggingface.co/daydreamlive/DreamVAE" rel="nofollow">https://huggingface.co/daydreamlive/DreamVAE</a>
Try it w/o installing: <a href="https://music.daydream.live" rel="nofollow">https://music.daydream.live</a>
Show HN: Filemat – an open-source web-based file manager
Hello HN,<p>I would like to share Filemat, a web-based file manager that I built because I wanted something with a simple setup and file permissions that work across the filesystem (as opposed to permissions only for a folder managed by the app).<p>It's self-hosted and open-source (currently in beta).<p>I'd be happy to hear your feedback<p>Repo:
<a href="https://github.com/bingud/filemat" rel="nofollow">https://github.com/bingud/filemat</a>
Show HN: CoreTex – An Open-Source, Unix-like, biomimetic, flat-file AI Harness
Show HN: Open-source Workspace (mail,docs,spreadsheet,drive) web/iOS
Show HN: Posthorn, self-hosted mail gateway
Introducing Posthorn, a self hosted email gateway. One docker container (or Go binary) between every self hosted app on your VPS and your transactional email provider. Set up Posthorn once, point your apps to it, done.<p>I was trying to deploy Ghost on a DigitalOcean droplet and found that DO and many different VPS services have started to block the default SMTP ports to try to combat the various types of abuse they get. To actually configure my app, I had to hack together a Postfix relay.<p>In another project, I had a static site which had a contact form, but my free Formspree account was occasionally hitting usage limits and I desperately wanted some of the anti-spam features they had gated behind their paid accounts so I put together a caddy module to catch HTTP POSTs and bounce them to my provider.<p>I kept bumping into these same email issues. Many of the services I wanted to host (Gitea, Mastodon, Umami, Comentario) ran into the same limitations. This felt like a really common issue that had no good solution.<p>Posthorn is what I built to solve this. It's a small Go binary (or 10 MB docker image) that sits between your self hosted apps and your transactional email provider of choice (shipping with support for Postmark, Resend, Mailgun, Amazon SES or an outbound SMTP relay). It also accepts POSTs from HTML forms to support static site needs while adding security layers such as honeypot fields, origin checks and IP rate limiting. There's also a JSON HTTP API that supports Bearer auth for backend scripts or cron jobs that just want a /send endpoint.<p>I now use this personally in multiple scenarios and I've spent a lot of time beating this up and testing against what I can validate. I'd love to hear how this might be useful for you, what breaks and any feedback you might have. It's open source under Apache 2.0 and I'd love contributions. I'm planning to support and grow this for the long haul.<p>Code: <a href="https://github.com/craigmccaskill/posthorn" rel="nofollow">https://github.com/craigmccaskill/posthorn</a><p>Docs: <a href="https://posthorn.dev/" rel="nofollow">https://posthorn.dev/</a><p>Longer write up: <a href="https://craigmccaskill.com/introducing-posthorn/" rel="nofollow">https://craigmccaskill.com/introducing-posthorn/</a><p>Previous HN discussion on the exact issue I'm trying to solve: <a href="https://news.ycombinator.com/item?id=43620318">https://news.ycombinator.com/item?id=43620318</a>
Show HN: I made an emergency page for my family
I have a bad memory and can't memorize some important numbers, so I created this project.<p>I've always been concerned about being without my phone (getting robbed - which is common in Brazil - running out of battery, having it break, etc.), so I decided to create a page that sends SMS messages (LLM-summarized) and emails with more detailed information such as geolocation, IP address, and the full message.<p>It’s a simple page that allows sending one or more messages, with recipients being myself and other people - for example, in case I or they need help or need to communicate something important.<p>The source code is available at <a href="https://github.com/skhaz/dokku/tree/main/apps/help" rel="nofollow">https://github.com/skhaz/dokku/tree/main/apps/help</a>
Show HN: Unsiloed AI – #1 on olmOCR-Bench
Most of the document parsers fail on real world challenges like complex tables, handwritten documents, historical document scans, equations, multi-column layouts, complex reading order, etc. We built Unsiloed Parser to handle exactly these cases.<p>Our latest parser v3.1 achieved #1 rank and scored 88.0 strict pass-rate on olmOCR-Bench. We ran the evaluation across 1,403 PDFs and 8,413 unit tests using the unmodified upstream Allen AI scorer (olmocr==0.4.27) and found Unsiloed beats 18 other OCR services, including GPT-5.5, Claude Opus 4.7, LlamaParse, Reducto, Azure Document Intelligence, AWS Textract, and Unstructured.<p>When we dug deeper into the failure cases, we found many errors were not OCR errors but things like \frac vs \dfrac, whitespace differences, or equivalent LaTeX renderings. We ran a secondary LLM-as-Judge evaluation to classify real misses vs semantic equivalents, which lifts the corrected score to 94.8 (explained deeply in the blog post).<p>Blog with full methodology and examples: <a href="https://www.unsiloed.ai/blog/unsiloed-ai-achieves-1-rank-on-olmocr-bench-2">https://www.unsiloed.ai/blog/unsiloed-ai-achieves-1-rank-on-...</a><p>Evaluation Code for reproducibility:
<a href="https://github.com/Unsiloed-AI/unsiloed-olmocr-benchmark" rel="nofollow">https://github.com/Unsiloed-AI/unsiloed-olmocr-benchmark</a><p>Feel free to post your messiest PDFs in the comment and we'll run it through Unsiloed parser and share the output here.
Show HN: skills-for-humanity – 171 structured reasoning skills for Claude Code
Show HN: TryPost – open-source Social Media Scheduler
Show HN: A website that tracks every stock trade Congress makes
Congressional trading data is relatively commoditized, but I couldn't find any open-source version with the features I wanted.<p>The data is lagged (median 28 days from trade to disclosure, and 19% miss this deadline), but there's still interesting patterns to explore.<p>Repo: <a href="https://github.com/kadoa-org/congress-trading-monitor" rel="nofollow">https://github.com/kadoa-org/congress-trading-monitor</a><p>Let me know if you have any suggestions for improvements!
Show HN: Fungible – A local personal finance app in the terminal
Fungible is a terminal-based personal finance app that fills the Mint-shaped void in my life. It runs using your own plaid credentials (optional) and has its own integrated chatbot (also optional and BYO key).<p>You connect banks via Plaid or import CSVs. Transactions get auto-categorized by rules you define. On top of normal categories there's a flexibility layer (fixed / flexible / discretionary) so you can see at a glance what's actually controllable spending. There are also tags (also separate from categories) for isolating things like trips or hobbies.<p>The financial health screen does savings rate, liquidity runway, and FIRE projection with adjustable assumptions. Probably overkill but I like it.<p>It also has an MCP server so Claude/ChatGPT can talk to you about your finances, create rules/tags, etc. That’s always the most annoying thing for me when trying a new personal finance tool. Hopefully this brings down the barrier to usefulness. The agent/chatbot in the app has the same tools as the Claude/ChatGPT would have via the MCP.
Show HN: Volt – front end tooling for Phoenix that runs inside the BEAM
Show HN: Volt – front end tooling for Phoenix that runs inside the BEAM
Show HN: OpenBrief – Local-first video downloader/summarizer
OpenBrief is basically a GUI for yt-dlp with some AI on top — paste a link, it downloads locally, and transcription and voice generation run with local AI on your machine. Summaries and chat over the transcript use an LLM, which is bring-your-own-key for now. It's open source and free.
Show HN: OpenBrief – Local-first video downloader/summarizer
OpenBrief is basically a GUI for yt-dlp with some AI on top — paste a link, it downloads locally, and transcription and voice generation run with local AI on your machine. Summaries and chat over the transcript use an LLM, which is bring-your-own-key for now. It's open source and free.
Show HN: OpenBrief – Local-first video downloader/summarizer
OpenBrief is basically a GUI for yt-dlp with some AI on top — paste a link, it downloads locally, and transcription and voice generation run with local AI on your machine. Summaries and chat over the transcript use an LLM, which is bring-your-own-key for now. It's open source and free.
Show HN: Kanban CLI (A local-first, agent-first task manager for the terminal)
Hello HN,<p>Ever since agents have become increasingly common in development, I've been scratching my head as to how to control their randomness. Recently, I decided to emulate an issue-tracking and project-management tool for agent-driven workflows.<p>Kanban is a Rust-based coordination layer designed to provide a feature-rich terminal interface and enforce rigorous workflows. It aims to be versatile and extendable, made to be tailored to any preferred flow. It comes with full git integration and guardrails such that only what truly benefits a project can go through.<p>The workflow boils down to 4 steps:<p>1. The model reads the skill to contextualize the requirements<p>2. It authenticates and receives a strict, schema-validated JSON payload outlining exact files, context, and acceptance criteria<p>3. Implementation is performed within an automatically isolated Git worktree and branch. The tool tracks progress (e.g., verifying all files were edited) before the task is submitted for review<p>4. A reviewer (preferably a human) evaluates the submission and manually transitions the task to "Done," which triggers the final merge and cleans up the task-specific environment.<p>The tool significantly decreases the agent development time, while increasing the human planning phase.<p>There is more to it than I can cover here, so I'd be happy to answer any questions about the architecture, the workflow, or the insights I gained while using it. For more information, I recommend skimming the README, which acts as an index to all documentation files.<p>Repo: <a href="https://codeberg.org/hydrafog/kanban" rel="nofollow">https://codeberg.org/hydrafog/kanban</a>
Show HN: The Front Page – Newspaper-style front page for Hacker News
Show HN: Geomatic – A command-driven geometry studio enabled with autodiff
All commands have the format `output = \func inputs` or just `\function inputs`. Points and scalars are built on the fly. Eg `\line a b` to an empty canvas creates points `a` and `b`, and joins them with a line.<p>One can use broadcasting semantics similar to NumPy and PyTorch in a visual setting (imagine creating a list of circles where one dim corresponds to radius and another to the center). One can also use backpropagation, run gradient descent or visualize vector fields. Almost everything is reactive so changing a variable updates all of the downstream geometry. It also allows anyone to write and load their own visualization, which can be broadcasted and differentiated through.<p><a href="https://www.tinyvolt.com/geomatic/examples/getting-started" rel="nofollow">https://www.tinyvolt.com/geomatic/examples/getting-started</a>
Show HN: Geomatic – A command-driven geometry studio enabled with autodiff
All commands have the format `output = \func inputs` or just `\function inputs`. Points and scalars are built on the fly. Eg `\line a b` to an empty canvas creates points `a` and `b`, and joins them with a line.<p>One can use broadcasting semantics similar to NumPy and PyTorch in a visual setting (imagine creating a list of circles where one dim corresponds to radius and another to the center). One can also use backpropagation, run gradient descent or visualize vector fields. Almost everything is reactive so changing a variable updates all of the downstream geometry. It also allows anyone to write and load their own visualization, which can be broadcasted and differentiated through.<p><a href="https://www.tinyvolt.com/geomatic/examples/getting-started" rel="nofollow">https://www.tinyvolt.com/geomatic/examples/getting-started</a>