The best Hacker News stories from Show from the past day

Go back

Latest posts:

Show HN: A tool to benchmark LLM APIs (OpenAI, Claude, local/self-hosted)

I recently built a small open-source tool to benchmark different LLM API endpoints — including OpenAI, Claude, and self-hosted models (like llama.cpp).<p>It runs a configurable number of test requests and reports two key metrics: • First-token latency (ms): How long it takes for the first token to appear • Output speed (tokens/sec): Overall output fluency<p>Demo: <a href="https://llmapitest.com/" rel="nofollow">https://llmapitest.com/</a> Code: <a href="https://github.com/qjr87/llm-api-test">https://github.com/qjr87/llm-api-test</a><p>The goal is to provide a simple, visual, and reproducible way to evaluate performance across different LLM providers, including the growing number of third-party “proxy” or “cheap LLM API” services.<p>It supports: • OpenAI-compatible APIs (official + proxies) • Claude (via Anthropic) • Local endpoints (custom/self-hosted)<p>You can also self-host it with docker-compose. Config is clean, adding a new provider only requires a simple plugin-style addition.<p>Would love feedback, PRs, or even test reports from APIs you’re using. Especially interested in how some lesser-known services compare.

Show HN: A different kind of AI Video generation

Hello!<p>I'm Andrew Arrow, a developer and Final Cut Pro user. My history with FCP goes back years and years, I tried to be a video editor as a career but ended up a software developer. I purchased the full version of the software a long, long time ago. And to Apple's credit, every single time I goto the App Store on any mac I still have access to download it. I must have been through 20, 30? I don't know, but a lot of macs over the years. Just recently I downloaded it again and noticed the XML Export feature.<p>Wow. There's a lot of stuff there. I mean a lot. I started playing around with generating the XML and that has lead to this project.<p><a href="https://github.com/andrewarrow/cutlass/blob/main/README.md">https://github.com/andrewarrow/cutlass/blob/main/README.md</a><p>Read through those go structs and xml tags. Keep going. And going. It just never stops! It's a very sophisticated XML format that can describe any timeline, effect, or animation you can imagine. Apple's documentation is scattered, the XML is notoriously finicky, and one wrong attribute crashes your import.<p>And of course with Claude Code (or others, pick your favorite AI) you can have your AI write some amazing things in go using cutlass.<p>```go // Generate a timeline with precise keyframe animations<p>video := fcp.Video{<p><pre><code> Ref: assetID, Offset: "0s", Duration: fcp.ConvertSecondsToFCPDuration(10.0), AdjustTransform: &fcp.AdjustTransform{ Params: []fcp.Param{{ Name: "position", KeyframeAnimation: &fcp.KeyframeAnimation{ Keyframes: []fcp.Keyframe{ {Time: "0s", Value: "0 0"}, {Time: "240240/24000s", Value: "100 50"}, }, }, }}, },</code></pre> } ```<p>So I haven't been writing code like that with my human brain. I spend most of my time just telling Claude the video I want to create, and he uses the cutlass library to achieve it. I'm hooking up FCP directly to Claude. A different kind of AI Video generation.<p>Would love feedback, thanks for reading this far! -aa

Show HN: Octelium – FOSS Alternative to Teleport, Cloudflare, Tailscale, Ngrok

I have been working on Octelium for quite a few years now but it was open sourced only by late May 2025. Octelium, as described more in detail in the repo's README, is simply an open source, self-hosted, unified platform for zero trust resource access that is primarily meant to be a modern alternative to corporate VPNs and remote access tools. It can operate as a remote access/corporate VPN (i.e. alternative to Twingate, Tailscale, OpenVPN Access Server, etc...), a ZTNA/BeyondCorp platform (i.e. alterntive to Cloudflare Access, Teleport, Google BeyondCorp, etc...), and it can also operate as an API/AI gateway, an infrastructure for MCP and A2A architectures and meshes, an ngrok alternative, a homelab infrastructure or even as a more advanced Kubernetes ingress. It's basically designed to operate like a unified Kubernetes-like scalable architecture for zero trust secure/remote access that's suitable for different human-to-workload and workload-to-workload environments. You can read more in detail the full set of main features and links about how it works in the repo's README or directly in the docs <a href="https://octelium.com/docs" rel="nofollow">https://octelium.com/docs</a>

Show HN: A Go service that exposes a FIFO message queue in RAM

Show HN: AGL a toy language that compiles to Go

I spent the past 2 weeks making this toy programming language.<p>I basically forked the Go scanner/parser, and changed the syntax to have functions return a single value. This enable proper Result/Option type to be used as well as propagating errors with an operator.<p>I also wanted to have short "type inferred" anonymous functions to be able to use functions like Map/Reduce/Filter, without having to use 100 characters to specify the types.

Show HN: AGL a toy language that compiles to Go

I spent the past 2 weeks making this toy programming language.<p>I basically forked the Go scanner/parser, and changed the syntax to have functions return a single value. This enable proper Result/Option type to be used as well as propagating errors with an operator.<p>I also wanted to have short "type inferred" anonymous functions to be able to use functions like Map/Reduce/Filter, without having to use 100 characters to specify the types.

Show HN: Vet – A tool for safely running remote shell scripts

Show HN: Vet – A tool for safely running remote shell scripts

Show HN: Shouldiuse.dev – Software dependency health checker

As software engineers we are often confronted with the decision of whether to code something ourselves or to add an existing library that does it for us.<p>Whether we like it or not – we are adding dependencies sooner or later. And it's arguably good practice to check a new dependency beforehand: Is it maintained? By whom? How many issues does it have and how many of those are bugs? Are they being fixed? What's on the roadmap? What's the release frequency and how often do APIs break?<p>One of our favorite solutions that already exist to answer such questions is the OpenSSF Scorecard project (<a href="https://github.com/ossf/scorecard">https://github.com/ossf/scorecard</a>) – we use this ourselves and can only recommend it.<p>We built shouldiuse.dev around it to make results accessible as a website, and used the opportunity to dive deep into heavily LLM-assisted coding for the first time in a professional project.<p>Three people (devs and non-devs) each started vibe-coding an initial prototypes, one using v0, one using lovable and one using Cursor. At first blown away by how fast we were able to generate these and how great there were looking, we soon ran into issues merging different ideas as there were multiple different web frameworks and versions flying around. The most work on the frontend definitely went into getting the details and small adaptions right.<p>In parallel, on the backend we started to write a Go application that uses the ossf/scorecard library to do a lot of the checks we want. To also play around with AI on that end, we intentionally made heavy use of Copilot and tried around with different models and prompts. We also added more metrics to the dependency check that we gather via GitHub API, and finally generate textual summaries via OpenAI.<p>The Prompt to generate a final textual recommendation consists of:<p>* A header stating the role, capabilities and limitations, and the expected response format (JSON and no lists/bullet points) – We also tell it to be critical, objective and give short and concise answers. * The result of the scorecard check * Additional community-related data * The questions that are being shown in the FAQ section – The answers to those are also generated by the LLM.<p>Since such a check involves heavy use of the GitHub API, we require users to input a GitHub personal access token when requesting a check. The first time a repository is checked on shouldiuse.dev it will take a few seconds, but then the results are stored in a postgres for faster retrieval later on.<p>For now it only works for public GitHub repos, but we might add other platforms if there is interest.<p>We also added a remote MCP server with built-in authentication, so you can directly access <i>shouldiuse</i> from your IDE and automatically check new dependencies anytime a coding assistant introduces one to ensure that only safe dependencies are added to the project.<p>What started as a fun internal experiment quickly surprised us with how useful it turned out to be. We didn’t plan to release it publicly, but we think might be useful for other devs and therefore we wanted to share it here. Any feedback is welcome!

Show HN: PILF, The ultimate solution to catastrophic oblivion on AI models

Show HN: PILF, The ultimate solution to catastrophic oblivion on AI models

Show HN: A Python Language Server, Mypy-compatible

Having created Jedi in 2012, I started ZubanLS in 2020 to advance Python tooling. It understands Mypy config files and passes >95% of the relevant Mypy tests. AMA.

Show HN: Do you know RGB?

Show HN: Do you know RGB?

Show HN: Do you know RGB?

Show HN: Sink – Sync any directory with any device on your local network

i made sink. it's a simple little tool that continuously syncs folders between 2 devices. no cloud, no email, flash drives, no bs.<p>it just uses your local wifi. run it on your machines, tell them to trust each other, and you're set. and if you manage to edit the same file at once, it handles the conflict and saves both copies.<p>for anyone who just wants to get files from point a to b without the headache. hope it makes your life a bit less annoying.<p>github: <a href="https://github.com/sirbread/sink">https://github.com/sirbread/sink</a> binary: <a href="https://github.com/sirbread/sink/releases/tag/v0.1">https://github.com/sirbread/sink/releases/tag/v0.1</a>

Show HN: Sink – Sync any directory with any device on your local network

i made sink. it's a simple little tool that continuously syncs folders between 2 devices. no cloud, no email, flash drives, no bs.<p>it just uses your local wifi. run it on your machines, tell them to trust each other, and you're set. and if you manage to edit the same file at once, it handles the conflict and saves both copies.<p>for anyone who just wants to get files from point a to b without the headache. hope it makes your life a bit less annoying.<p>github: <a href="https://github.com/sirbread/sink">https://github.com/sirbread/sink</a> binary: <a href="https://github.com/sirbread/sink/releases/tag/v0.1">https://github.com/sirbread/sink/releases/tag/v0.1</a>

Show HN: Sink – Sync any directory with any device on your local network

i made sink. it's a simple little tool that continuously syncs folders between 2 devices. no cloud, no email, flash drives, no bs.<p>it just uses your local wifi. run it on your machines, tell them to trust each other, and you're set. and if you manage to edit the same file at once, it handles the conflict and saves both copies.<p>for anyone who just wants to get files from point a to b without the headache. hope it makes your life a bit less annoying.<p>github: <a href="https://github.com/sirbread/sink">https://github.com/sirbread/sink</a> binary: <a href="https://github.com/sirbread/sink/releases/tag/v0.1">https://github.com/sirbread/sink/releases/tag/v0.1</a>

Show HN: Zenta – Mindfulness for Terminal Users

Show HN: Zenta – Mindfulness for Terminal Users

1 2 3 ... 826 827 828 >