The best Hacker News stories from Show from the past day
Latest posts:
Show HN: Build an open-source computer vision model in seconds using text
Hello HN! I want to share something me and a few friends have been working on for a while now — Zeroshot, a web tool that builds image classifiers using text-image models and autolabeling. What does this mean in practice? You can put together an image classifier in about 30 seconds that’s faster and more accurate than CLIP, but that you can deploy yourself however you’d like. It’s open source, commercially licensed, and doesn’t require you to pay anyone per API call.<p>Here's a 2 minute video that shows it off: <a href="https://www.youtube.com/watch?v=S4R1gtmM-Lo" rel="nofollow noreferrer">https://www.youtube.com/watch?v=S4R1gtmM-Lo</a><p>How/why does it work?
We believe that with the rise of foundation vision models, computer vision will fundamentally change. These powerful models will let any devs “compile” a model ahead of time with a subset of the foundation model’s characteristics, using only text and a web-tool. The days of teams of MLEs building complex models and pipelines are ending.<p>Zeroshot works by using two powerful pre-trained models, CLIP and DINOv2 together. The web-app allows users to quickly create our training sets via text search. Using pre-cached DINOv2 features, we generate a simple linear model that can be trained and deployed without any fine-tuning. Since you can see what’s going into your training set, you can tune your prompts to get the type of performance or detail you want.<p>CLIP Small -- Size: 335 MB, Latency: 35ms<p>CLIP Large -- Size: 891 MB, Latency: 276ms<p>Zeroshot -- Size: 85 MB, Latency: 20ms<p>What’s next?
We wanna see how people use or would use the tool before deciding what to do next. On the list: clients for iOS and NodeJS, speeding up GPU inference times via TensorRT, offering larger Zeroshot models for better accuracy, easier results refining, support for bringing your own data lake, model refinement using GPT-V, we’ve got plenty of ideas.
Show HN: SwaraNotebook – a notation editor for Indian classical music
As an enthusiast of Indian classical music, I needed to write music notations in the traditional typeset format. When I didn't find any existing editors, I developed a Swara Notebook, a mobile focused web app to write North Indian Classical (<i>Hindustani</i>) music notations.<p>The notes (called <i>Sargam</i>, similar to Solfege) can be written in English, Devnagri and Bangla scripts. The transcribed song can be played back in 6 different rhythmic cycles (<i>Taal</i>) to the accompaniment of the <i>Tabla</i>(a type of drum) or a metronome. Here's an example of a transcribed song <a href="https://swaranotebook.com/view/vlB9hVgb5OdKcadbhOOYyzKwvpl2/53219a9192a5-Piyarava-ab-tum" rel="nofollow noreferrer">https://swaranotebook.com/view/vlB9hVgb5OdKcadbhOOYyzKwvpl2/...</a><p>Since North Indian classical music is oriented around <i>Ragas</i> (similar to modes in western classical music), the keyboard adapts to the notes of a <i>Raga</i>, making it contextually easier to key in notes.<p>A common question I get: can it handle microtones? I chose to not support microtones and other pitch ornamentation such as glissando (called <i>Meend</i>) since a mobile interface is not the easiest place to add such details.<p>I also find it a useful tool for ear training, by typing out notations to songs I know, and playing it back to know if I guessed the notes right.<p>It is an open source project written in Clojure/script
<a href="https://github.com/Studio-kalavati/bandish-editor">https://github.com/Studio-kalavati/bandish-editor</a>
Show HN: SwaraNotebook – a notation editor for Indian classical music
As an enthusiast of Indian classical music, I needed to write music notations in the traditional typeset format. When I didn't find any existing editors, I developed a Swara Notebook, a mobile focused web app to write North Indian Classical (<i>Hindustani</i>) music notations.<p>The notes (called <i>Sargam</i>, similar to Solfege) can be written in English, Devnagri and Bangla scripts. The transcribed song can be played back in 6 different rhythmic cycles (<i>Taal</i>) to the accompaniment of the <i>Tabla</i>(a type of drum) or a metronome. Here's an example of a transcribed song <a href="https://swaranotebook.com/view/vlB9hVgb5OdKcadbhOOYyzKwvpl2/53219a9192a5-Piyarava-ab-tum" rel="nofollow noreferrer">https://swaranotebook.com/view/vlB9hVgb5OdKcadbhOOYyzKwvpl2/...</a><p>Since North Indian classical music is oriented around <i>Ragas</i> (similar to modes in western classical music), the keyboard adapts to the notes of a <i>Raga</i>, making it contextually easier to key in notes.<p>A common question I get: can it handle microtones? I chose to not support microtones and other pitch ornamentation such as glissando (called <i>Meend</i>) since a mobile interface is not the easiest place to add such details.<p>I also find it a useful tool for ear training, by typing out notations to songs I know, and playing it back to know if I guessed the notes right.<p>It is an open source project written in Clojure/script
<a href="https://github.com/Studio-kalavati/bandish-editor">https://github.com/Studio-kalavati/bandish-editor</a>
Show HN:Draw Fast - Real-time AI image generation based on drawings in a canvas
Show HN: Gitea Cloud: A brand new platform for managed Gitea Instances
Show HN: I built a guided Build your own DNS Server challenge
Hey everyone. It's Sherub here, author of the Build your own DNS Server challenge on CodeCrafters. Currently it’s available in Rust, Go, and Python and is free while in beta.<p><a href="https://codecrafters.io/dns-server">https://codecrafters.io/dns-server</a><p>I've kept the challenge accessible but still challenging for an intermediate developer. This challenge, like others from CodeCrafters, is self-paced. You can use any tools you prefer (terminal, editor, etc.) to build the project.<p>At the end of the challenge, you will have created a DNS forwarding server. The server can create and read DNS packets and respond to DNS queries. As you go, you'll learn about the DNS protocol, its format, servers, and A records. All while getting to hone your language skills.<p>Some of the challenges and choices I had to make:<p>* To make the stages easier, I had to break them up, such that each step builds on the previous one. This was especially challenging for the 2nd stage, where we write a DNS packet's header contents. Even though I’d have liked it to be easier, breaking it up further would have been weird.<p>* Instead of implementing a recursive resolver, I've restricted to a forwarding server. We made this decision so that most developers can still use it. To add more complexity, we can use a challenge extension (noted below).<p>* Deciding how much instruction and context the stages should provide. I’ve decided to keep them as thorough as possible for most of the stages. Developers can choose to have thorough details or just skim through them.<p>I would love your feedback and questions on the challenge. You can try it out for free here: <a href="https://codecrafters.io/dns-server">https://codecrafters.io/dns-server</a> (no CC required).<p>I also have challenge extensions planned. You can find them at <a href="https://app.codecrafters.io/vote/challenge-extension-ideas?course=dns-server">https://app.codecrafters.io/vote/challenge-extension-ideas?c...</a>. I'm also keen to hear what you think about the extension ideas.
Show HN: I built a guided Build your own DNS Server challenge
Hey everyone. It's Sherub here, author of the Build your own DNS Server challenge on CodeCrafters. Currently it’s available in Rust, Go, and Python and is free while in beta.<p><a href="https://codecrafters.io/dns-server">https://codecrafters.io/dns-server</a><p>I've kept the challenge accessible but still challenging for an intermediate developer. This challenge, like others from CodeCrafters, is self-paced. You can use any tools you prefer (terminal, editor, etc.) to build the project.<p>At the end of the challenge, you will have created a DNS forwarding server. The server can create and read DNS packets and respond to DNS queries. As you go, you'll learn about the DNS protocol, its format, servers, and A records. All while getting to hone your language skills.<p>Some of the challenges and choices I had to make:<p>* To make the stages easier, I had to break them up, such that each step builds on the previous one. This was especially challenging for the 2nd stage, where we write a DNS packet's header contents. Even though I’d have liked it to be easier, breaking it up further would have been weird.<p>* Instead of implementing a recursive resolver, I've restricted to a forwarding server. We made this decision so that most developers can still use it. To add more complexity, we can use a challenge extension (noted below).<p>* Deciding how much instruction and context the stages should provide. I’ve decided to keep them as thorough as possible for most of the stages. Developers can choose to have thorough details or just skim through them.<p>I would love your feedback and questions on the challenge. You can try it out for free here: <a href="https://codecrafters.io/dns-server">https://codecrafters.io/dns-server</a> (no CC required).<p>I also have challenge extensions planned. You can find them at <a href="https://app.codecrafters.io/vote/challenge-extension-ideas?course=dns-server">https://app.codecrafters.io/vote/challenge-extension-ideas?c...</a>. I'm also keen to hear what you think about the extension ideas.
Show HN: A Dalle-3 and GPT4-Vision feedback loop
I used to enjoy Translation Party, and over the weekend I realized that we can build the same feedback loop with DALLE-3 and GPT4-Vision. Start with a text prompt, let DALLE-3 generate an image, then GPT-4 Vision turns that image back into a text prompt, DALLE-3 creates another image, and so on.<p>You need to bring your own OpenAI API key (costs about $0.10/run)<p>Some prompts are very stable, others go wild. If you bias GPT4's prompting by telling it to "make it weird" you can get crazy results.<p>Here's a few of my favorites:<p>- Gnomes: <a href="https://dalle.party/?party=k4eeMQ6I" rel="nofollow noreferrer">https://dalle.party/?party=k4eeMQ6I</a><p>- Start with a sailboat but bias GPT4V to "replace everything with cats": <a href="https://dalle.party/?party=0uKfJjQn" rel="nofollow noreferrer">https://dalle.party/?party=0uKfJjQn</a><p>- A more stable one (but everyone is always an actor): <a href="https://dalle.party/?party=oxpeZKh5" rel="nofollow noreferrer">https://dalle.party/?party=oxpeZKh5</a>
Show HN: A Dalle-3 and GPT4-Vision feedback loop
I used to enjoy Translation Party, and over the weekend I realized that we can build the same feedback loop with DALLE-3 and GPT4-Vision. Start with a text prompt, let DALLE-3 generate an image, then GPT-4 Vision turns that image back into a text prompt, DALLE-3 creates another image, and so on.<p>You need to bring your own OpenAI API key (costs about $0.10/run)<p>Some prompts are very stable, others go wild. If you bias GPT4's prompting by telling it to "make it weird" you can get crazy results.<p>Here's a few of my favorites:<p>- Gnomes: <a href="https://dalle.party/?party=k4eeMQ6I" rel="nofollow noreferrer">https://dalle.party/?party=k4eeMQ6I</a><p>- Start with a sailboat but bias GPT4V to "replace everything with cats": <a href="https://dalle.party/?party=0uKfJjQn" rel="nofollow noreferrer">https://dalle.party/?party=0uKfJjQn</a><p>- A more stable one (but everyone is always an actor): <a href="https://dalle.party/?party=oxpeZKh5" rel="nofollow noreferrer">https://dalle.party/?party=oxpeZKh5</a>
Show HN: A Dalle-3 and GPT4-Vision feedback loop
I used to enjoy Translation Party, and over the weekend I realized that we can build the same feedback loop with DALLE-3 and GPT4-Vision. Start with a text prompt, let DALLE-3 generate an image, then GPT-4 Vision turns that image back into a text prompt, DALLE-3 creates another image, and so on.<p>You need to bring your own OpenAI API key (costs about $0.10/run)<p>Some prompts are very stable, others go wild. If you bias GPT4's prompting by telling it to "make it weird" you can get crazy results.<p>Here's a few of my favorites:<p>- Gnomes: <a href="https://dalle.party/?party=k4eeMQ6I" rel="nofollow noreferrer">https://dalle.party/?party=k4eeMQ6I</a><p>- Start with a sailboat but bias GPT4V to "replace everything with cats": <a href="https://dalle.party/?party=0uKfJjQn" rel="nofollow noreferrer">https://dalle.party/?party=0uKfJjQn</a><p>- A more stable one (but everyone is always an actor): <a href="https://dalle.party/?party=oxpeZKh5" rel="nofollow noreferrer">https://dalle.party/?party=oxpeZKh5</a>
Show HN: A Dalle-3 and GPT4-Vision feedback loop
I used to enjoy Translation Party, and over the weekend I realized that we can build the same feedback loop with DALLE-3 and GPT4-Vision. Start with a text prompt, let DALLE-3 generate an image, then GPT-4 Vision turns that image back into a text prompt, DALLE-3 creates another image, and so on.<p>You need to bring your own OpenAI API key (costs about $0.10/run)<p>Some prompts are very stable, others go wild. If you bias GPT4's prompting by telling it to "make it weird" you can get crazy results.<p>Here's a few of my favorites:<p>- Gnomes: <a href="https://dalle.party/?party=k4eeMQ6I" rel="nofollow noreferrer">https://dalle.party/?party=k4eeMQ6I</a><p>- Start with a sailboat but bias GPT4V to "replace everything with cats": <a href="https://dalle.party/?party=0uKfJjQn" rel="nofollow noreferrer">https://dalle.party/?party=0uKfJjQn</a><p>- A more stable one (but everyone is always an actor): <a href="https://dalle.party/?party=oxpeZKh5" rel="nofollow noreferrer">https://dalle.party/?party=oxpeZKh5</a>
Show HN: Trains.fyi – a live map of passenger trains in the US and Canada
Hey all! My train the other day was delayed and I got curious where they all were at any given time, so I built a map and figured I'd share it.
Show HN: Trains.fyi – a live map of passenger trains in the US and Canada
Hey all! My train the other day was delayed and I got curious where they all were at any given time, so I built a map and figured I'd share it.
Show HN: Trains.fyi – a live map of passenger trains in the US and Canada
Hey all! My train the other day was delayed and I got curious where they all were at any given time, so I built a map and figured I'd share it.
Show HN: GitAware – A simple browser extension that helps you stay on top of MRs
Show HN: We made an interactive coloring experience for kids
Hi HN community!<p>Proud to have recently become a founder myself and now to submit a "Show HN".<p>We are Dibulo, a young startup which is on a mission to redefine coloring sessions for kids! We have built our first worlds, sold our first coloring books and now we want to let the world know about us :) So far we have been praised by parents, and also got pretty good feedback from institutions like daycares and elementary school teachers.<p>What sets our coloring experience apart? Not only do we bring vibrant templates on paper to life on screens, but we also introduce a captivating group activity. Picture this: kids enthusiastically coloring away, their creations showcased prominently on a big screen for all to admire.<p>For us it is crucial that the user experience is smooth and easy. Our platform works in pretty much all modern browsers and on all major devices (PC, laptop, tablet, AndroidTV / FireTV). Setting up a coloring session on Dibulo takes less time than a kid can color our templates and uploading is as easy as snapping a picture. You can even share the coloring session with your family and friends and if they like they can participate and upload their drawings too (remotely).<p>Feel free to download and print our free templates and try them with your kids / nephews / at your daycare.<p>Thank you for checking us out! Looking forward to receiving feedback!
Show HN: Minimalistic UI Library
CodeSnaps is a React and Tailwind CSS UI component library with dark mode support. It helps developers to design faster and better with copy-and-paste code snippets without building their MVPs and apps from scratch.
Show HN: An open-source notification infrastructure
Show HN: An open-source notification infrastructure
Sopwith – a classic bi-plane shoot 'em up from 1984 in the browser
src: <a href="https://github.com/midzer/sdl-sopwith">https://github.com/midzer/sdl-sopwith</a><p>via: <a href="https://fragglet.github.io/sdl-sopwith/" rel="nofollow noreferrer">https://fragglet.github.io/sdl-sopwith/</a>