The best Hacker News stories from Show from the past day

Go back

Latest posts:

Show HN: We scaled Git to support 1 TB repos

I’ve been in the MLOps space for ~10 years, and data is still the hardest unsolved open problem. Code is versioned using Git, data is stored somewhere else, and context often lives in a 3rd location like Slack or GDocs. This is why we built XetHub, a platform that enables teams to treat data like code, using Git.<p>Unlike Git LFS, we don’t just store the files. We use content-defined chunking and Merkle Trees to dedupe against everything in history. This allows small changes in large files to be stored compactly. Read more here: <a href="https://xethub.com/assets/docs/how-xet-deduplication-works" rel="nofollow">https://xethub.com/assets/docs/how-xet-deduplication-works</a><p>Today, XetHub works for 1 TB repositories, and we plan to scale to 100 TB in the next year. Our implementation is in Rust (client & cache + storage) and our web application is written in Go. XetHub includes a GitHub-like web interface that provides automatic CSV summaries and allows custom visualizations using Vega. Even at 1 TB, we know downloading an entire repository is painful, so we built git-xet mount - which, in seconds, provides a user-mode filesystem view over the repo.<p>XetHub is available today (Linux & Mac today, Windows coming soon) and we would love your feedback!<p>Read more here:<p>- <a href="https://xetdata.com/blog/2022/10/15/why-xetdata" rel="nofollow">https://xetdata.com/blog/2022/10/15/why-xetdata</a><p>- <a href="https://xetdata.com/blog/2022/12/13/introducing-xethub" rel="nofollow">https://xetdata.com/blog/2022/12/13/introducing-xethub</a>

Show HN: We scaled Git to support 1 TB repos

I’ve been in the MLOps space for ~10 years, and data is still the hardest unsolved open problem. Code is versioned using Git, data is stored somewhere else, and context often lives in a 3rd location like Slack or GDocs. This is why we built XetHub, a platform that enables teams to treat data like code, using Git.<p>Unlike Git LFS, we don’t just store the files. We use content-defined chunking and Merkle Trees to dedupe against everything in history. This allows small changes in large files to be stored compactly. Read more here: <a href="https://xethub.com/assets/docs/how-xet-deduplication-works" rel="nofollow">https://xethub.com/assets/docs/how-xet-deduplication-works</a><p>Today, XetHub works for 1 TB repositories, and we plan to scale to 100 TB in the next year. Our implementation is in Rust (client & cache + storage) and our web application is written in Go. XetHub includes a GitHub-like web interface that provides automatic CSV summaries and allows custom visualizations using Vega. Even at 1 TB, we know downloading an entire repository is painful, so we built git-xet mount - which, in seconds, provides a user-mode filesystem view over the repo.<p>XetHub is available today (Linux & Mac today, Windows coming soon) and we would love your feedback!<p>Read more here:<p>- <a href="https://xetdata.com/blog/2022/10/15/why-xetdata" rel="nofollow">https://xetdata.com/blog/2022/10/15/why-xetdata</a><p>- <a href="https://xetdata.com/blog/2022/12/13/introducing-xethub" rel="nofollow">https://xetdata.com/blog/2022/12/13/introducing-xethub</a>

Show HN: We scaled Git to support 1 TB repos

I’ve been in the MLOps space for ~10 years, and data is still the hardest unsolved open problem. Code is versioned using Git, data is stored somewhere else, and context often lives in a 3rd location like Slack or GDocs. This is why we built XetHub, a platform that enables teams to treat data like code, using Git.<p>Unlike Git LFS, we don’t just store the files. We use content-defined chunking and Merkle Trees to dedupe against everything in history. This allows small changes in large files to be stored compactly. Read more here: <a href="https://xethub.com/assets/docs/how-xet-deduplication-works" rel="nofollow">https://xethub.com/assets/docs/how-xet-deduplication-works</a><p>Today, XetHub works for 1 TB repositories, and we plan to scale to 100 TB in the next year. Our implementation is in Rust (client & cache + storage) and our web application is written in Go. XetHub includes a GitHub-like web interface that provides automatic CSV summaries and allows custom visualizations using Vega. Even at 1 TB, we know downloading an entire repository is painful, so we built git-xet mount - which, in seconds, provides a user-mode filesystem view over the repo.<p>XetHub is available today (Linux & Mac today, Windows coming soon) and we would love your feedback!<p>Read more here:<p>- <a href="https://xetdata.com/blog/2022/10/15/why-xetdata" rel="nofollow">https://xetdata.com/blog/2022/10/15/why-xetdata</a><p>- <a href="https://xetdata.com/blog/2022/12/13/introducing-xethub" rel="nofollow">https://xetdata.com/blog/2022/12/13/introducing-xethub</a>

Show HN: Why YC when there's YZ?

The Crowd Supported, Crowd Constructed, Opened sourced Acceleration program for the next generation of builders. Note: Website is just to get feedback initially.

Show HN: ChatGPT Prompts and Products

Show HN: Prompt Engineering Course

Show HN: Train your own image generator (Stable Diffusion) for free

I've had a blast playing with stable diffusion and I see all the potential it will bring to us. I released a service for training your model, just upload 20-30 images and you can have a model of someone or some object doing anything. You can train one model for free a month in a slower queue or you can train many models on a fast queue and with other features for a fee. Here is an example of using it for show product placement: <a href="https://app.88stacks.com/image/O6kReClOvrz7" rel="nofollow">https://app.88stacks.com/image/O6kReClOvrz7</a> and here is an example of using it for people: <a href="https://app.88stacks.com/image/nOpdvCwx6kb7" rel="nofollow">https://app.88stacks.com/image/nOpdvCwx6kb7</a> and an example for using it for styles: <a href="https://app.88stacks.com/image/zyjw6CmEgk2d" rel="nofollow">https://app.88stacks.com/image/zyjw6CmEgk2d</a> The UI is rough, but I would love feedback on how to improve it for you. <a href="https://88stacks.com" rel="nofollow">https://88stacks.com</a>

Show HN: Train your own image generator (Stable Diffusion) for free

I've had a blast playing with stable diffusion and I see all the potential it will bring to us. I released a service for training your model, just upload 20-30 images and you can have a model of someone or some object doing anything. You can train one model for free a month in a slower queue or you can train many models on a fast queue and with other features for a fee. Here is an example of using it for show product placement: <a href="https://app.88stacks.com/image/O6kReClOvrz7" rel="nofollow">https://app.88stacks.com/image/O6kReClOvrz7</a> and here is an example of using it for people: <a href="https://app.88stacks.com/image/nOpdvCwx6kb7" rel="nofollow">https://app.88stacks.com/image/nOpdvCwx6kb7</a> and an example for using it for styles: <a href="https://app.88stacks.com/image/zyjw6CmEgk2d" rel="nofollow">https://app.88stacks.com/image/zyjw6CmEgk2d</a> The UI is rough, but I would love feedback on how to improve it for you. <a href="https://88stacks.com" rel="nofollow">https://88stacks.com</a>

Show HN: Embed a snow effect on your website

Embed a snow effect on your website with one line of code (~1.3kB), and add some Christmas magic to your websites. A quick weekend project :)

Show HN: Embed a snow effect on your website

Embed a snow effect on your website with one line of code (~1.3kB), and add some Christmas magic to your websites. A quick weekend project :)

Show HN: Embed a snow effect on your website

Embed a snow effect on your website with one line of code (~1.3kB), and add some Christmas magic to your websites. A quick weekend project :)

Show HN: BeSanta, Real time Santa Claus AI avatars on your iPhone

I am the founder of animato.ai and just released "BeSanta", an iPhone app that you can use to create custom Santa Claus videos in real time, to bring some surprise and joy to your kids.<p>The Santa Claus characters were made with generative AI and run on the phone neural engine in 16ms or something like that.<p>Tired of the endless lines at the mall for meeting Santa Claus? Check it out :)

Show HN: Mult.dev – zero-config alternative to Google Earth studio

The powerful tool to create travel/geo infographics and animated maps. You can even add the stickers on the planet!<p>Example <a href="https://www.youtube.com/watch?v=Eq_DvPoCMmM" rel="nofollow">https://www.youtube.com/watch?v=Eq_DvPoCMmM</a>

Show HN: Mult.dev – zero-config alternative to Google Earth studio

The powerful tool to create travel/geo infographics and animated maps. You can even add the stickers on the planet!<p>Example <a href="https://www.youtube.com/watch?v=Eq_DvPoCMmM" rel="nofollow">https://www.youtube.com/watch?v=Eq_DvPoCMmM</a>

Show HN: Mult.dev – zero-config alternative to Google Earth studio

The powerful tool to create travel/geo infographics and animated maps. You can even add the stickers on the planet!<p>Example <a href="https://www.youtube.com/watch?v=Eq_DvPoCMmM" rel="nofollow">https://www.youtube.com/watch?v=Eq_DvPoCMmM</a>

Show HN: Panoptisch – A recursive dependency scanner for Python projects

Hello all,<p>Very excited to share this project with you all!<p>Panoptisch scans your Python file or module to find it's imports (aka dependencies) and recursively does so for all dependencies and sub-dependencies. It then generates a dependency tree in JSON for you to parse and enforce import policies.<p>Supply chain attacks are no joke, and this is one way to transparently analyze your dependencies to see if any malicious imports are taking place. For example, your yaml parser, nor it's sub-dependencies should import socket, or sys.<p>Panoptisch is in early stages, with known limitations (for now). I welcome feedback, testing and contributions.<p>Also, happy to answer any questions!

Show HN: Panoptisch – A recursive dependency scanner for Python projects

Hello all,<p>Very excited to share this project with you all!<p>Panoptisch scans your Python file or module to find it's imports (aka dependencies) and recursively does so for all dependencies and sub-dependencies. It then generates a dependency tree in JSON for you to parse and enforce import policies.<p>Supply chain attacks are no joke, and this is one way to transparently analyze your dependencies to see if any malicious imports are taking place. For example, your yaml parser, nor it's sub-dependencies should import socket, or sys.<p>Panoptisch is in early stages, with known limitations (for now). I welcome feedback, testing and contributions.<p>Also, happy to answer any questions!

Show HN: Interactive exercises for Linux CLI text processing commands

Last month, I started learning a Python TUI framework called Textual. After working on a 4x4 board game, I made an interactive app to test your CLI text processing skills with 40 beginner to intermediate level exercises. The app is fairly basic in terms of features — only single input file, so no stdin data, multiple files, etc.<p>Most of the exercises in this app is based on my Computing from the Command Line ebook (<a href="https://learnbyexample.github.io/cli-computing/preface.html" rel="nofollow">https://learnbyexample.github.io/cli-computing/preface.html</a>). I hope to improve the TUI app to support all the 200+ exercises sometime next year.

Show HN: Interactive exercises for Linux CLI text processing commands

Last month, I started learning a Python TUI framework called Textual. After working on a 4x4 board game, I made an interactive app to test your CLI text processing skills with 40 beginner to intermediate level exercises. The app is fairly basic in terms of features — only single input file, so no stdin data, multiple files, etc.<p>Most of the exercises in this app is based on my Computing from the Command Line ebook (<a href="https://learnbyexample.github.io/cli-computing/preface.html" rel="nofollow">https://learnbyexample.github.io/cli-computing/preface.html</a>). I hope to improve the TUI app to support all the 200+ exercises sometime next year.

Show HN: Pg_CRDT – an experimental CRDT extension for Postgres

This is an experimental extension for CRDTs, pg_crdt: GitHub repo[0]. It supports Yjs/Yrs and Automerge.<p>The linked blog post [1] describes how we're thinking about this extension in a Supabase context. I want to emphasise this part: "pg_crdt has not been released onto the Supabase platform (and it may never be). We’re considering many options for offline-sync/support and, while CRDTs will undoubtedly factor in, we’re not sure if this is the right approach."<p>[0] GitHub repo: <a href="https://github.com/supabase/pg_crdt" rel="nofollow">https://github.com/supabase/pg_crdt</a><p>[1] Blog post: <a href="https://supabase.com/blog/postgres-crdt" rel="nofollow">https://supabase.com/blog/postgres-crdt</a>

< 1 2 3 ... 591 592 593 594 595 ... 957 958 959 >