The best Hacker News stories from All from the past day

Go back

Latest posts:

Zod 4

Zod 4

Have I Been Pwned 2.0

Have I Been Pwned 2.0

Don't guess my language

The Windows Subsystem for Linux is now open source

The Windows Subsystem for Linux is now open source

$30 Homebrew Automated Blinds Opener (2024)

Mystical

Show HN: I modeled the Voynich Manuscript with SBERT to test for structure

I built this project as a way to learn more about NLP by applying it to something weird and unsolved.<p>The Voynich Manuscript is a 15th-century book written in an unknown script. No one’s been able to translate it, and many think it’s a hoax, a cipher, or a constructed language. I wasn’t trying to decode it — I just wanted to see: does it behave like a structured language?<p>I stripped a handful of common suffix-like endings (aiin, dy, etc.) to isolate what looked like root forms. I know that’s a strong assumption — I call it out directly in the repo — but it helped clarify the clustering. From there, I used SBERT embeddings and KMeans to group similar roots, inferred POS-like roles based on position and frequency, and built a Markov transition matrix to visualize cluster-to-cluster flow.<p>It’s not translation. It’s not decryption. It’s structural modeling — and it revealed some surprisingly consistent syntax across the manuscript, especially when broken out by section (Botanical, Biological, etc.).<p>GitHub repo: <a href="https://github.com/brianmg/voynich-nlp-analysis">https://github.com/brianmg/voynich-nlp-analysis</a> Write-up: <a href="https://brig90.substack.com/p/modeling-the-voynich-manuscript-with?r=3z5dn9" rel="nofollow">https://brig90.substack.com/p/modeling-the-voynich-manuscrip...</a><p>I’m new to the NLP space, so I’m sure there are things I got wrong — but I’d love feedback from people who’ve worked with structured language modeling or weird edge cases like this.

AniSora: Open-source anime video generation model

If nothing is curated, how do we find things

France Endorses UN Open Source Principles

France Endorses UN Open Source Principles

Ditching Obsidian and building my own

Ditching Obsidian and building my own

Spaced repetition systems have gotten better

Spaced repetition systems have gotten better

JavaScript's New Superpower: Explicit Resource Management

MIT asks arXiv to withdraw preprint of paper on AI and scientific discovery

< 1 2 3 ... 5 6 7 8 9 ... 814 815 816 >