The best Hacker News stories from All from the past day
Latest posts:
Jules: An asynchronous coding agent
Making video games (without an engine) in 2025
DDoSecrets publishes 410 GB of heap dumps, hacked from TeleMessage
Finland announces migration of its rail network to international gauge
Veo 3 and Imagen 4, and a new tool for filmmaking called Flow
Veo 3 and Imagen 4, and a new tool for filmmaking called Flow
European Investment Bank to inject €70B in European tech
GitHub Copilot Coding Agent
GitHub Copilot Coding Agent
InventWood is about to mass-produce wood that's stronger than steel
Zod 4
Zod 4
Have I Been Pwned 2.0
Have I Been Pwned 2.0
Don't guess my language
The Windows Subsystem for Linux is now open source
The Windows Subsystem for Linux is now open source
$30 Homebrew Automated Blinds Opener (2024)
Mystical
Show HN: I modeled the Voynich Manuscript with SBERT to test for structure
I built this project as a way to learn more about NLP by applying it to something weird and unsolved.<p>The Voynich Manuscript is a 15th-century book written in an unknown script. No one’s been able to translate it, and many think it’s a hoax, a cipher, or a constructed language. I wasn’t trying to decode it — I just wanted to see: does it behave like a structured language?<p>I stripped a handful of common suffix-like endings (aiin, dy, etc.) to isolate what looked like root forms. I know that’s a strong assumption — I call it out directly in the repo — but it helped clarify the clustering. From there, I used SBERT embeddings and KMeans to group similar roots, inferred POS-like roles based on position and frequency, and built a Markov transition matrix to visualize cluster-to-cluster flow.<p>It’s not translation. It’s not decryption. It’s structural modeling — and it revealed some surprisingly consistent syntax across the manuscript, especially when broken out by section (Botanical, Biological, etc.).<p>GitHub repo: <a href="https://github.com/brianmg/voynich-nlp-analysis">https://github.com/brianmg/voynich-nlp-analysis</a>
Write-up: <a href="https://brig90.substack.com/p/modeling-the-voynich-manuscript-with?r=3z5dn9" rel="nofollow">https://brig90.substack.com/p/modeling-the-voynich-manuscrip...</a><p>I’m new to the NLP space, so I’m sure there are things I got wrong — but I’d love feedback from people who’ve worked with structured language modeling or weird edge cases like this.