The best Hacker News stories from Show from the past day

Latest posts:

Show HN: Qwertle, yet another daily word game

A new Wordle-like game, where guesses are ranked by how close letters are on a QWERTY keyboard. Try it out and let me know what you think!

Show HN: Qwertle, yet another daily word game

A new Wordle-like game, where guesses are ranked by how close letters are on a QWERTY keyboard. Try it out and let me know what you think!

Show HN: Lumos, a local LLM Chrome extension

Lumos is an LLM co-pilot for browsing the web, powered by local LLMs (Ollama).<p>- Summarize long threads on issue tracking sites, forums, and social media sites<p>- Summarize news articles<p>- Ask questions about reviews on business and product pages<p>- Ask questions about long, technical documentation<p>- What else?

Show HN: Lumos, a local LLM Chrome extension

Lumos is an LLM co-pilot for browsing the web, powered by local LLMs (Ollama).<p>- Summarize long threads on issue tracking sites, forums, and social media sites<p>- Summarize news articles<p>- Ask questions about reviews on business and product pages<p>- Ask questions about long, technical documentation<p>- What else?

Show HN: Lumos, a local LLM Chrome extension

Lumos is an LLM co-pilot for browsing the web, powered by local LLMs (Ollama).<p>- Summarize long threads on issue tracking sites, forums, and social media sites<p>- Summarize news articles<p>- Ask questions about reviews on business and product pages<p>- Ask questions about long, technical documentation<p>- What else?

Show HN: Lumos, a local LLM Chrome extension

Lumos is an LLM co-pilot for browsing the web, powered by local LLMs (Ollama).<p>- Summarize long threads on issue tracking sites, forums, and social media sites<p>- Summarize news articles<p>- Ask questions about reviews on business and product pages<p>- Ask questions about long, technical documentation<p>- What else?

Show HN: Lumos, a local LLM Chrome extension

Lumos is an LLM co-pilot for browsing the web, powered by local LLMs (Ollama).<p>- Summarize long threads on issue tracking sites, forums, and social media sites<p>- Summarize news articles<p>- Ask questions about reviews on business and product pages<p>- Ask questions about long, technical documentation<p>- What else?

Show HN: Deep search of all ML papers

Built an automated system to run a deep search of ArXiv and carefully find all the precise papers that exist on a complex topic.<p>It's different from simple RAG because it searches, classifies, and adapts based on relevant papers it uncovers, and then continues until it finds every paper on a topic (trying to mimic the human research process). Benchmarked 10x higher accuracy and total retrieval compared to Google Scholar for a median search (whitepaper on website). Also knows when it is complete, and misses virtually nothing (< 3% or so, once it's converged).<p>Website has a free trial and a bunch of example search reports. Want feedback and suggestions.

Show HN: Deep search of all ML papers

Built an automated system to run a deep search of ArXiv and carefully find all the precise papers that exist on a complex topic.<p>It's different from simple RAG because it searches, classifies, and adapts based on relevant papers it uncovers, and then continues until it finds every paper on a topic (trying to mimic the human research process). Benchmarked 10x higher accuracy and total retrieval compared to Google Scholar for a median search (whitepaper on website). Also knows when it is complete, and misses virtually nothing (< 3% or so, once it's converged).<p>Website has a free trial and a bunch of example search reports. Want feedback and suggestions.

Show HN: Bernard – a link checker for your website

Introducing Bernard, the project I have been working on solo since November 2022. After months of planning, coding and talking with a small number of potential users, I feel it is time to launch and get some real feedback.<p><i>GOAL</i>: Bernard is an automated links checker that crawls your website daily, identifying broken links (internal and external), and missing redirects, such as after a redesign or URL change.<p><i>TARGET AUDIENCE</i>: Anyone with a website—be it is a personal blog or company portal—looking to prevent link rot, to keep old URLs reachable and to avoid the dreaded 404 Page Not Found error.<p><i>STACK</i>: Elixir, with Phoenix Live View for the interactive dashboard. It runs on PostgreSQL, hosted as Podman containers on a single dedicated Hetzner server.<p><i>PLAN</i>: I am releasing this as a free, open beta to address the last few issues, and hope to introduce paid plans in February 2024. While there is a backlog of improvements and bug fixes, my primary focus now is making sure the product is aligned with what users need.<p>The pricing model is yet to be defined, but I'm considering a free tier for small website owners, and a usage-based plan starting from $10/mo for X,000 links checked per month. This would pair well with the upcoming REST API to provide a links-checker-as-a-service product to embed in third-party systems. (Feel free to contact me if you might be interested in using this API)<p>I also wrote about my journey to get this off the ground, and the challenges I faced at <a href="https://combo.cc/posts/bernard-devlog-3-pre-launch-reflections/" rel="nofollow">https://combo.cc/posts/bernard-devlog-3-pre-launch-reflectio...</a><p>Looking forward to your criticism and suggestions.

Show HN: Bernard – a link checker for your website

Introducing Bernard, the project I have been working on solo since November 2022. After months of planning, coding and talking with a small number of potential users, I feel it is time to launch and get some real feedback.<p><i>GOAL</i>: Bernard is an automated links checker that crawls your website daily, identifying broken links (internal and external), and missing redirects, such as after a redesign or URL change.<p><i>TARGET AUDIENCE</i>: Anyone with a website—be it is a personal blog or company portal—looking to prevent link rot, to keep old URLs reachable and to avoid the dreaded 404 Page Not Found error.<p><i>STACK</i>: Elixir, with Phoenix Live View for the interactive dashboard. It runs on PostgreSQL, hosted as Podman containers on a single dedicated Hetzner server.<p><i>PLAN</i>: I am releasing this as a free, open beta to address the last few issues, and hope to introduce paid plans in February 2024. While there is a backlog of improvements and bug fixes, my primary focus now is making sure the product is aligned with what users need.<p>The pricing model is yet to be defined, but I'm considering a free tier for small website owners, and a usage-based plan starting from $10/mo for X,000 links checked per month. This would pair well with the upcoming REST API to provide a links-checker-as-a-service product to embed in third-party systems. (Feel free to contact me if you might be interested in using this API)<p>I also wrote about my journey to get this off the ground, and the challenges I faced at <a href="https://combo.cc/posts/bernard-devlog-3-pre-launch-reflections/" rel="nofollow">https://combo.cc/posts/bernard-devlog-3-pre-launch-reflectio...</a><p>Looking forward to your criticism and suggestions.

Show HN: Open-source script to get your site indexed on Google

Show HN: Open-source script to get your site indexed on Google

Show HN: Open-source script to get your site indexed on Google

ZX Spectrum Raytracer

Inspired by a recent article in HN about raytracing on some ancient architecture, I realized that I've implemented many a raytracer in my life, but never one for my first love, the ZX Spectrum. So I fixed the glitch.<p>Trigger warning: the source code includes GO TO statements. Because that's how we did it in the 80s! Enter at your own peril.

Show HN: Open-source Rule-based PDF parser for RAG

The PDF parser is a rule based parser which uses text co-ordinates (boundary box), graphics and font data. The PDF parser works off text layer and also offers a OCR option to automatically use OCR if there are scanned pages in your PDFs. The OCR feature is based off a modified version of tika which uses tesseract underneath.<p>The PDF Parser offers the following features:<p>* Sections and subsections along with their levels. * Paragraphs - combines lines. * Links between sections and paragraphs. * Tables along with the section the tables are found in. * Lists and nested lists. * Join content spread across pages. * Removal of repeating headers and footers. * Watermark removal. * OCR with boundary boxes

Show HN: Open-source Rule-based PDF parser for RAG

The PDF parser is a rule based parser which uses text co-ordinates (boundary box), graphics and font data. The PDF parser works off text layer and also offers a OCR option to automatically use OCR if there are scanned pages in your PDFs. The OCR feature is based off a modified version of tika which uses tesseract underneath.<p>The PDF Parser offers the following features:<p>* Sections and subsections along with their levels. * Paragraphs - combines lines. * Links between sections and paragraphs. * Tables along with the section the tables are found in. * Lists and nested lists. * Join content spread across pages. * Removal of repeating headers and footers. * Watermark removal. * OCR with boundary boxes

Show HN: Open-source Rule-based PDF parser for RAG

The PDF parser is a rule based parser which uses text co-ordinates (boundary box), graphics and font data. The PDF parser works off text layer and also offers a OCR option to automatically use OCR if there are scanned pages in your PDFs. The OCR feature is based off a modified version of tika which uses tesseract underneath.<p>The PDF Parser offers the following features:<p>* Sections and subsections along with their levels. * Paragraphs - combines lines. * Links between sections and paragraphs. * Tables along with the section the tables are found in. * Lists and nested lists. * Join content spread across pages. * Removal of repeating headers and footers. * Watermark removal. * OCR with boundary boxes

Show HN: Open-source Rule-based PDF parser for RAG

The PDF parser is a rule based parser which uses text co-ordinates (boundary box), graphics and font data. The PDF parser works off text layer and also offers a OCR option to automatically use OCR if there are scanned pages in your PDFs. The OCR feature is based off a modified version of tika which uses tesseract underneath.<p>The PDF Parser offers the following features:<p>* Sections and subsections along with their levels. * Paragraphs - combines lines. * Links between sections and paragraphs. * Tables along with the section the tables are found in. * Lists and nested lists. * Join content spread across pages. * Removal of repeating headers and footers. * Watermark removal. * OCR with boundary boxes

Show HN: I wished for a site with a growing list of math problems, I built it

Good math problems are hidden inside textbooks and online documents. To keep up with all the sources in the world is hard. For someone who just wants to continuously solve problems, finding and going through all the sources feels like a hassle. I wished for a website that could just dump all the math problems available in the world out there. And if I could filter the problems by topics, that would be beautiful.<p>teachyourselfmath is a side project that was born out of this need. At its core, it is a math PDF extraction engine. The engine has some machine learning going on behind the scenes to extract math problems in LaTeX from any image or document.<p>A little bit about me: I am Vivek, a software engineer based out of India with a diverse set of interests including math. This project is close to my heart for many different reasons and nothing would make me happier than finding people on the internet who would find this website to be useful.<p>I’d love to hear your feedback on this. Thanks!