The best Hacker News stories from All from the past day
Latest posts:
Zod 4
Zod 4
Have I Been Pwned 2.0
Have I Been Pwned 2.0
Don't guess my language
The Windows Subsystem for Linux is now open source
The Windows Subsystem for Linux is now open source
$30 Homebrew Automated Blinds Opener (2024)
Mystical
Show HN: I modeled the Voynich Manuscript with SBERT to test for structure
I built this project as a way to learn more about NLP by applying it to something weird and unsolved.<p>The Voynich Manuscript is a 15th-century book written in an unknown script. No one’s been able to translate it, and many think it’s a hoax, a cipher, or a constructed language. I wasn’t trying to decode it — I just wanted to see: does it behave like a structured language?<p>I stripped a handful of common suffix-like endings (aiin, dy, etc.) to isolate what looked like root forms. I know that’s a strong assumption — I call it out directly in the repo — but it helped clarify the clustering. From there, I used SBERT embeddings and KMeans to group similar roots, inferred POS-like roles based on position and frequency, and built a Markov transition matrix to visualize cluster-to-cluster flow.<p>It’s not translation. It’s not decryption. It’s structural modeling — and it revealed some surprisingly consistent syntax across the manuscript, especially when broken out by section (Botanical, Biological, etc.).<p>GitHub repo: <a href="https://github.com/brianmg/voynich-nlp-analysis">https://github.com/brianmg/voynich-nlp-analysis</a>
Write-up: <a href="https://brig90.substack.com/p/modeling-the-voynich-manuscript-with?r=3z5dn9" rel="nofollow">https://brig90.substack.com/p/modeling-the-voynich-manuscrip...</a><p>I’m new to the NLP space, so I’m sure there are things I got wrong — but I’d love feedback from people who’ve worked with structured language modeling or weird edge cases like this.
AniSora: Open-source anime video generation model
If nothing is curated, how do we find things
France Endorses UN Open Source Principles
France Endorses UN Open Source Principles
Ditching Obsidian and building my own
Ditching Obsidian and building my own
Spaced repetition systems have gotten better
Spaced repetition systems have gotten better
JavaScript's New Superpower: Explicit Resource Management
MIT asks arXiv to withdraw preprint of paper on AI and scientific discovery