The best Hacker News stories from Show from the past week
Latest posts:
Show HN: Open-source Rule-based PDF parser for RAG
The PDF parser is a rule based parser which uses text co-ordinates (boundary box), graphics and font data. The PDF parser works off text layer and also offers a OCR option to automatically use OCR if there are scanned pages in your PDFs. The OCR feature is based off a modified version of tika which uses tesseract underneath.<p>The PDF Parser offers the following features:<p>* Sections and subsections along with their levels.
* Paragraphs - combines lines.
* Links between sections and paragraphs.
* Tables along with the section the tables are found in.
* Lists and nested lists.
* Join content spread across pages.
* Removal of repeating headers and footers.
* Watermark removal.
* OCR with boundary boxes
Show HN: I wished for a site with a growing list of math problems, I built it
Good math problems are hidden inside textbooks and online documents. To keep up with all the sources in the world is hard. For someone who just wants to continuously solve problems, finding and going through all the sources feels like a hassle. I wished for a website that could just dump all the math problems available in the world out there. And if I could filter the problems by topics, that would be beautiful.<p>teachyourselfmath is a side project that was born out of this need. At its core, it is a math PDF extraction engine. The engine has some machine learning going on behind the scenes to extract math problems in LaTeX from any image or document.<p>A little bit about me: I am Vivek, a software engineer based out of India with a diverse set of interests including math. This project is close to my heart for many different reasons and nothing would make me happier than finding people on the internet who would find this website to be useful.<p>I’d love to hear your feedback on this. Thanks!
Show HN: I wished for a site with a growing list of math problems, I built it
Good math problems are hidden inside textbooks and online documents. To keep up with all the sources in the world is hard. For someone who just wants to continuously solve problems, finding and going through all the sources feels like a hassle. I wished for a website that could just dump all the math problems available in the world out there. And if I could filter the problems by topics, that would be beautiful.<p>teachyourselfmath is a side project that was born out of this need. At its core, it is a math PDF extraction engine. The engine has some machine learning going on behind the scenes to extract math problems in LaTeX from any image or document.<p>A little bit about me: I am Vivek, a software engineer based out of India with a diverse set of interests including math. This project is close to my heart for many different reasons and nothing would make me happier than finding people on the internet who would find this website to be useful.<p>I’d love to hear your feedback on this. Thanks!
Show HN: Startup funding simulator
Hi HN<p>We built a tool to help founders understand how modern fundraising (with safes) works, and how much dilution you can expect when raising money.<p>The project is open-source. The code is a mess right now, but it'll get better I promise. You can also help with that.<p>We didn't build this to make money. We genuinely did it because we were looking for it, and couldn't find it.<p>We're in fact in the process of fundraising for a company, and at first glance the process looks simple. Just an excel sheet will do! But then the more we dug into it and tried different simulators, the more we realized that it's more complex than it looks.<p>We even signed up to Pulley, Carta and others just to run simulations. But they're a bit confusing.<p>TL;DR: Understanding modern startup funding and knowing how much dilution you'll face is hard. We built a tool that'll hopefully help with that. You can add Post-money Safes, priced rounds and issue options to employees, and you can see how that affects your ownership at every step. You can also simulate an Exit scenario and see how much money you'll be left with.<p>---<p>Some examples of complex stuff:<p>- There are many different types of safes. They all convert at the first priced round, but in different ways. Some are through discount, some are uncapped, some have a fixed valuation cap, and some have both a discount and a valuation cap.<p>- All safes (before first priced round) convert at the same time. They don't dilute each other, which is what happens in the rest of fundraising.<p>- Investors often require you to set aside some options. This one is particularily nasty. Basically, if an investor expects you to set aside 10% as options, and expects to get 10% equity, that's what should appear in the subsequent cap table. However, calculating the options is difficult, and is often a circular calculation (even Kirsty Nathoo from YC says it's complex and avoids showing the calculation in the Safe video "Understanding SAFEs and Priced Equity Rounds")<p>- Safes and priced rounds can have pro-rata, but don't always exercise it<p>- Pro-ratas of safes are taken from the priced round money, so you'd expect the safe holder's equity to remain the same if they exercise it. BUT ... it gets diluted by the new options issued.<p>- Safes can have an MFN provision, which defers the valuation discussion/calculation until the moment the priced round is about to close. With a mix of discounts, uncapped and valuation caps, it gets tricky to know which deal is "better".<p>- ...<p>Assumptions and limitations:<p>- Only post-money safes and priced rounds.<p>- No down rounds. There's a bit more complexity around liquidation preferences and anti-dilution rights - we don't support that now. It only matters if you're simulating a "bad" situation. But come on, it's a simulator — Be optimistic.<p>- No pro-rata caps. We might add that soon, to fully support the YC standard deal. But for now, if an investor gets a pro-rata, they can exercise either all of it (keeping their original ownership) or none.<p>- Safes' pro-ratas disappear after the first priced round. (I think this is what happens normally?)<p>- Remaining available options get redistributed evenly at exit.<p>- The round is the investor. For the sake of simplicity, consider "Series A" as the combination of all series A investors into one, super-investor.<p>Let us know what you think!
Show HN: Startup funding simulator
Hi HN<p>We built a tool to help founders understand how modern fundraising (with safes) works, and how much dilution you can expect when raising money.<p>The project is open-source. The code is a mess right now, but it'll get better I promise. You can also help with that.<p>We didn't build this to make money. We genuinely did it because we were looking for it, and couldn't find it.<p>We're in fact in the process of fundraising for a company, and at first glance the process looks simple. Just an excel sheet will do! But then the more we dug into it and tried different simulators, the more we realized that it's more complex than it looks.<p>We even signed up to Pulley, Carta and others just to run simulations. But they're a bit confusing.<p>TL;DR: Understanding modern startup funding and knowing how much dilution you'll face is hard. We built a tool that'll hopefully help with that. You can add Post-money Safes, priced rounds and issue options to employees, and you can see how that affects your ownership at every step. You can also simulate an Exit scenario and see how much money you'll be left with.<p>---<p>Some examples of complex stuff:<p>- There are many different types of safes. They all convert at the first priced round, but in different ways. Some are through discount, some are uncapped, some have a fixed valuation cap, and some have both a discount and a valuation cap.<p>- All safes (before first priced round) convert at the same time. They don't dilute each other, which is what happens in the rest of fundraising.<p>- Investors often require you to set aside some options. This one is particularily nasty. Basically, if an investor expects you to set aside 10% as options, and expects to get 10% equity, that's what should appear in the subsequent cap table. However, calculating the options is difficult, and is often a circular calculation (even Kirsty Nathoo from YC says it's complex and avoids showing the calculation in the Safe video "Understanding SAFEs and Priced Equity Rounds")<p>- Safes and priced rounds can have pro-rata, but don't always exercise it<p>- Pro-ratas of safes are taken from the priced round money, so you'd expect the safe holder's equity to remain the same if they exercise it. BUT ... it gets diluted by the new options issued.<p>- Safes can have an MFN provision, which defers the valuation discussion/calculation until the moment the priced round is about to close. With a mix of discounts, uncapped and valuation caps, it gets tricky to know which deal is "better".<p>- ...<p>Assumptions and limitations:<p>- Only post-money safes and priced rounds.<p>- No down rounds. There's a bit more complexity around liquidation preferences and anti-dilution rights - we don't support that now. It only matters if you're simulating a "bad" situation. But come on, it's a simulator — Be optimistic.<p>- No pro-rata caps. We might add that soon, to fully support the YC standard deal. But for now, if an investor gets a pro-rata, they can exercise either all of it (keeping their original ownership) or none.<p>- Safes' pro-ratas disappear after the first priced round. (I think this is what happens normally?)<p>- Remaining available options get redistributed evenly at exit.<p>- The round is the investor. For the sake of simplicity, consider "Series A" as the combination of all series A investors into one, super-investor.<p>Let us know what you think!
Show HN: Retriever – Securely share secrets over the internet
Retriever (<a href="https://retriever.corgea.io/" rel="nofollow">https://retriever.corgea.io/</a>), an open-source research project to help users receive secrets and sensitive information without needing a server in the middle. It works by using Public-key cryptography to coordinate the message sharing between the two devices.<p>Read more on why we built it here: <a href="https://retriever.corgea.io/why.html" rel="nofollow">https://retriever.corgea.io/why.html</a>
Show HN: Retriever – Securely share secrets over the internet
Retriever (<a href="https://retriever.corgea.io/" rel="nofollow">https://retriever.corgea.io/</a>), an open-source research project to help users receive secrets and sensitive information without needing a server in the middle. It works by using Public-key cryptography to coordinate the message sharing between the two devices.<p>Read more on why we built it here: <a href="https://retriever.corgea.io/why.html" rel="nofollow">https://retriever.corgea.io/why.html</a>
Show HN: Built a self hosted clean status page and batteries
Status pages have been the way they have seen the mid 2010s. There are few new ones but they are paid. So I decided to build this using svelte + sveltekit. It has all the necessary features. Few are yet to be built. Do check it out
Show HN: Nutrient insights through your grocery receipts
Nutri is still in beta and the GPT-powered results are sometimes inaccurate. The nutrient information accuracy is good to get an overview, but there are still outliers at times. I'm looking to improve the accuracy through food databases. Furthermore, I'd like to add additional tips for combining / preparing food to improve its nutritional value. For example, iron absorption is improved through vitamin C, so combine chickpeas or leafy greens with lemon. Or combine beans with rice to get all amino acids.<p>On the UX side, I'd like to integrate a QR code on the desktop version to easily upload receipts through the phone. Furthermore, it would be great to have analytics over weeks on nutrient improvements over time. Nutri could also be a great accountability partner to track items high in sugar / processed foods.<p>What do you think?
Show HN: I made an app people call "Airdrop for Android"
Show HN: I made an app people call "Airdrop for Android"
Show HN: Coffeehouse, one-on-one voicechat with random HN users
I made a website to share rejection letters
Hi HN,<p>First time posting on Show HN. Spent two weeks over Christmas and new years to make this fun little full stack web app built with Next.js and Supabase PostgreSQL, hosted on Netlify<p>open to feedback and hope you enjoy it!
Show HN: Shadeup – A language that makes WebGPU easier
Show HN: Htmldocs – Typeset and generate pdfs with HTML/CSS
htmldocs is an Overleaf-style editor for typesetting documents using HTML/CSS, which provides the same benefits as LaTeX while being more accessible, customizable, and familiar.<p>I built this because I wanted to programatically generate invoices as well as automatically tailor my resume to jobs but had no good way of generating well-formatted PDFs. I ended up building a templating engine to Chromium rendering pipeline to generate PDFs, and due to the amount of engineering effort, turned it into a tool for others that might want to do the same. There's a built-in API (<a href="https://htmldocs.com/docs/documents" rel="nofollow">https://htmldocs.com/docs/documents</a>) that you can call to turn JSON into PDFs in a single call.<p>htmldocs is different from other tools like Wkhtmltopdf and Weasyprint in that it uses Chromium to generate PDFs, meaning that it supports the most modern CSS features and there's minimal drift between the rendered HTML document and PDF.<p>Will also consider open sourcing if there's enough interest in the project!
Show HN: I made a website to find best bus seat to avoid the sun while traveling
Show HN: Material Files – Open Source Material Design File Manager for Android
Features:<p>- Open source: Lightweight, clean and secure.<p>- Material Design: Follows Material Design guidelines, with attention into details.<p>- Breadcrumbs: Navigate in the filesystem with ease.<p>- Root support: View and manage files with root access.<p>- Archive support: View, extract and create common compressed files.<p>- NAS support: View and manage files on FTP, SFTP and SMB servers.<p>- Themes: Customizable UI colors, plus night mode with optional true black.<p>- Linux-aware: Knows symbolic links, file permissions and SELinux context.<p>- Robust: Uses Linux system calls under the hood, not yet another ls parser.<p>- Well-implemented: Built upon the right things, including Java NIO2 File API and LiveData.
Show HN: Material Files – Open Source Material Design File Manager for Android
Features:<p>- Open source: Lightweight, clean and secure.<p>- Material Design: Follows Material Design guidelines, with attention into details.<p>- Breadcrumbs: Navigate in the filesystem with ease.<p>- Root support: View and manage files with root access.<p>- Archive support: View, extract and create common compressed files.<p>- NAS support: View and manage files on FTP, SFTP and SMB servers.<p>- Themes: Customizable UI colors, plus night mode with optional true black.<p>- Linux-aware: Knows symbolic links, file permissions and SELinux context.<p>- Robust: Uses Linux system calls under the hood, not yet another ls parser.<p>- Well-implemented: Built upon the right things, including Java NIO2 File API and LiveData.
Show HN: Citadel – a Calibre-compatible eBook management app
Hey folks! This winter I've been building Citadel to scratch my itch of managing ebooks without using Calibre. Calibre is incredibly powerful, but it's slow and awkward to use.<p>I dreamed of writing a native app (and originally tried this in Swift), but ran into issues building the UI. Plus, whatever I built would only work on macOS. I started writing Citadel using Tauri (Svelte on the frontend + Rust on the backend) to have a cross-platform desktop app. Plus, Citadel supports running in a headless / webbrowser mode. You can self-host a Citadel server that manages your library, and connect to it from anywhere with the web.<p>This is SUPER early software. Honestly, I'm a bit embarrassed to post it here — but I wanted folks to know that I'm building a tool to replace Calibre. If you'd like to help build it, I'd love the help! If you just want to follow the journey, please do!
Show HN: Citadel – a Calibre-compatible eBook management app
Hey folks! This winter I've been building Citadel to scratch my itch of managing ebooks without using Calibre. Calibre is incredibly powerful, but it's slow and awkward to use.<p>I dreamed of writing a native app (and originally tried this in Swift), but ran into issues building the UI. Plus, whatever I built would only work on macOS. I started writing Citadel using Tauri (Svelte on the frontend + Rust on the backend) to have a cross-platform desktop app. Plus, Citadel supports running in a headless / webbrowser mode. You can self-host a Citadel server that manages your library, and connect to it from anywhere with the web.<p>This is SUPER early software. Honestly, I'm a bit embarrassed to post it here — but I wanted folks to know that I'm building a tool to replace Calibre. If you'd like to help build it, I'd love the help! If you just want to follow the journey, please do!