The best Hacker News stories from Show from the past day
Latest posts:
Show HN: A contact form that prunes spam
Show HN: Mp3tag for Mac – universal tag editor now on the Mac
Show HN: CompreFace is a free and open-source face recognition software
Show HN: Ad Network for Sideprojects
Show HN: Ad Network for Sideprojects
Show HN: Sabre – The bullshit-free (c) programming language
Show HN: Sabre – The bullshit-free (c) programming language
Show HN: A browser extension to use picture-in-picture with any website
Show HN: A browser extension to use picture-in-picture with any website
Show HN: Vellum – An interactive list of nonfiction books reviewed by academics
Show HN: Vellum – An interactive list of nonfiction books reviewed by academics
Show HN: Archive as you browse, store locally and/or share with others via IPFS
Show HN: Archive as you browse, store locally and/or share with others via IPFS
Show HN: Archive as you browse, store locally and/or share with others via IPFS
Launch HN: HiGeorge (YC W21) – Real-time data visualizations for public datasets
Hi HN!<p>Anuj here. My co-founder Amir (Aazo11) and I are building HiGeorge (<a href="https://hi-george.com/" rel="nofollow">https://hi-george.com/</a>). We make localized drag-and-drop data visualizations so that all publishers, even the small ones, can better leverage data in their storytelling. Think Tableau with all the necessary data attached.<p>At the onset of the pandemic Amir and I were looking for local data on the spread of the virus. We visited the sites of large national newsrooms like the NYTimes and were impressed by the quality of data visualizations and maps, but they lacked the geographic granularity for our own neighborhood.<p>We then turned to our local newsrooms but found they presented data in tables and lists that made it difficult to comprehend the virus’ spread and trends. We wondered why. After talking to local journalists and publishers, we found that newsrooms simply do not have the resources to make sense of large datasets.<p>Public datasets are hard to clean, poorly structured, and constantly updated. One publisher explained to us that she would refresh her state health department’s website 5 times a day waiting for updated COVID data, then manually download a CSV and clean it in Excel. This process could take hours, and it needed to happen every day.<p>This is where HiGeorge comes in. We clean and aggregate public datasets and turn them into auto-updating data visualizations that anyone can instantly use with a simple copy/paste. Our data visualizations can be drag-and-dropped into articles, allowing news publishers to offer compelling data content to their communities.<p>Check out a few versions of what we’re doing with customers -- COVID-19 data reporting at North Carolina Health News [1], COVID-19 vaccine site mapping at SFGATE [2], real-time crime reporting in Dallas, TX [3], and police use of force at Mission Local [4].<p>Today, HiGeorge works with dozens of newsrooms across the country. Our visualizations have driven a 2x increase in pageviews and a 75% increase in session duration for our partner publishers. We charge a monthly subscription for access to our data visualization library – a fraction of the cost of an in-house data engineer. In the long run, we are building HiGeorge so that it becomes the single place to collaborate on and publish data content.<p>We’d love to hear from the HN community and we’ll be hanging out in the comments if you have any questions or feedback.<p>[1]<a href="https://www.northcarolinahealthnews.org/2021/02/09/coronavirus-today-feb-9-deaths-top-10000-vaccine-roll-out-focuses-on-equity-and-efficiency/" rel="nofollow">https://www.northcarolinahealthnews.org/2021/02/09/coronavir...</a>
[2] <a href="https://www.sfgate.com/bayarea/article/vaccine-sites-San-Francisco-Bay-Area-appointments-15935161.php" rel="nofollow">https://www.sfgate.com/bayarea/article/vaccine-sites-San-Fra...</a>
[3] <a href="https://lakehighlands.advocatemag.com/2021/02/data-crime-trends-in-dallas-lake-highlands-in-early-february/" rel="nofollow">https://lakehighlands.advocatemag.com/2021/02/data-crime-tre...</a>
[4] <a href="https://missionlocal.org/crime-data/" rel="nofollow">https://missionlocal.org/crime-data/</a>
Launch HN: HiGeorge (YC W21) – Real-time data visualizations for public datasets
Hi HN!<p>Anuj here. My co-founder Amir (Aazo11) and I are building HiGeorge (<a href="https://hi-george.com/" rel="nofollow">https://hi-george.com/</a>). We make localized drag-and-drop data visualizations so that all publishers, even the small ones, can better leverage data in their storytelling. Think Tableau with all the necessary data attached.<p>At the onset of the pandemic Amir and I were looking for local data on the spread of the virus. We visited the sites of large national newsrooms like the NYTimes and were impressed by the quality of data visualizations and maps, but they lacked the geographic granularity for our own neighborhood.<p>We then turned to our local newsrooms but found they presented data in tables and lists that made it difficult to comprehend the virus’ spread and trends. We wondered why. After talking to local journalists and publishers, we found that newsrooms simply do not have the resources to make sense of large datasets.<p>Public datasets are hard to clean, poorly structured, and constantly updated. One publisher explained to us that she would refresh her state health department’s website 5 times a day waiting for updated COVID data, then manually download a CSV and clean it in Excel. This process could take hours, and it needed to happen every day.<p>This is where HiGeorge comes in. We clean and aggregate public datasets and turn them into auto-updating data visualizations that anyone can instantly use with a simple copy/paste. Our data visualizations can be drag-and-dropped into articles, allowing news publishers to offer compelling data content to their communities.<p>Check out a few versions of what we’re doing with customers -- COVID-19 data reporting at North Carolina Health News [1], COVID-19 vaccine site mapping at SFGATE [2], real-time crime reporting in Dallas, TX [3], and police use of force at Mission Local [4].<p>Today, HiGeorge works with dozens of newsrooms across the country. Our visualizations have driven a 2x increase in pageviews and a 75% increase in session duration for our partner publishers. We charge a monthly subscription for access to our data visualization library – a fraction of the cost of an in-house data engineer. In the long run, we are building HiGeorge so that it becomes the single place to collaborate on and publish data content.<p>We’d love to hear from the HN community and we’ll be hanging out in the comments if you have any questions or feedback.<p>[1]<a href="https://www.northcarolinahealthnews.org/2021/02/09/coronavirus-today-feb-9-deaths-top-10000-vaccine-roll-out-focuses-on-equity-and-efficiency/" rel="nofollow">https://www.northcarolinahealthnews.org/2021/02/09/coronavir...</a>
[2] <a href="https://www.sfgate.com/bayarea/article/vaccine-sites-San-Francisco-Bay-Area-appointments-15935161.php" rel="nofollow">https://www.sfgate.com/bayarea/article/vaccine-sites-San-Fra...</a>
[3] <a href="https://lakehighlands.advocatemag.com/2021/02/data-crime-trends-in-dallas-lake-highlands-in-early-february/" rel="nofollow">https://lakehighlands.advocatemag.com/2021/02/data-crime-tre...</a>
[4] <a href="https://missionlocal.org/crime-data/" rel="nofollow">https://missionlocal.org/crime-data/</a>
Launch HN: MindsDB (YC W20) – Machine Learning Inside Your Database
Hi HN,<p>Adam and Jorge here, and today we’re very excited to share MindsDB with you (<a href="http://github.com/mindsdb/mindsdb" rel="nofollow">http://github.com/mindsdb/mindsdb</a>). MindsDB AutoML Server is an open-source platform designed to accelerate machine learning workflows for people with data inside databases by introducing virtual AI tables. We allow you to create and consume machine learning models as regular database tables.<p>Jorge and I have been friends for many years, having first met at college. We have previously founded and failed at another startup, but we stuck together as a team to start MindsDB. Initially a passion project, MindsDB began as an idea to help those who could not afford to hire a team of data scientists, which at the time was (and still is) very expensive. It has since grown into a thriving open-source community with contributors and users all over the globe.<p>With the plethora of data available in databases today, predictive modeling can often be a pain, especially if you need to write complex applications for ingesting data, training encoders and embedders, writing sampling algorithms, training models, optimizing, scheduling, versioning, moving models into production environments, maintaining them and then having to explain the predictions and the degree of confidence… we knew there had to be a better way!<p>We aim to steer you away from constantly reinventing the wheel by abstracting most of the unnecessary complexities around building, training, and deploying machine learning models. MindsDB provides you with two techniques for this: build and train models as simply as you would write an SQL query, and seamlessly “publish” and manage machine learning models as virtual tables inside your databases (we support Clickhouse, MariaDB, MySQL, PostgreSQL, and MSSQL. MongoDB is coming soon.) We also support getting data from other sources, such as Snowflake, s3, SQLite, and any excel, JSON, or CSV file.<p>When we talk to our community, we find that they are using MindsDB for anything ranging from reducing financial risk in the payments sector to predicting in-app usage statistics - one user is even trying to predict the price of Bitcoin using sentiment analysis (we wish them luck). No matter what the use-case, what we hear most often is that the two most painful parts of the whole process are model generation (R&D) and/or moving the model into production.<p>For those who already have models (i.e. who have already done the R&D part), we are launching the ability to bring your own models from frameworks like Pytorch, Tensorflow, scikit-learn, Keras, XGBoost, CatBoost, LightGBM, etc. directly into your database. If you’d like to try this experimental feature, you can sign-up here: (<a href="https://mindsdb.com/bring-your-own-ml-models" rel="nofollow">https://mindsdb.com/bring-your-own-ml-models</a>)<p>We currently have a handful of customers who pay us for support. However, we will soon be launching a cloud version of MindsDB for those who do not want to worry about DevOps, scalability, and managing GPU clusters. Nevertheless, MindsDB will always remain free and open-source, because democratizing machine learning is at the core of every decision we make.<p>We’re making good progress thanks to our open-source community and are also grateful to have the backing of the founders of MySQL & MariaDB. We would love your feedback and invite you to try it out.<p>We’d also love to hear about your experience, so please share your feedback, thoughts, comments, and ideas below. <a href="https://docs.mindsdb.com/" rel="nofollow">https://docs.mindsdb.com/</a> or <a href="https://mindsdb.com/" rel="nofollow">https://mindsdb.com/</a><p>Thanks in advance,
Adam & Jorge
Launch HN: MindsDB (YC W20) – Machine Learning Inside Your Database
Hi HN,<p>Adam and Jorge here, and today we’re very excited to share MindsDB with you (<a href="http://github.com/mindsdb/mindsdb" rel="nofollow">http://github.com/mindsdb/mindsdb</a>). MindsDB AutoML Server is an open-source platform designed to accelerate machine learning workflows for people with data inside databases by introducing virtual AI tables. We allow you to create and consume machine learning models as regular database tables.<p>Jorge and I have been friends for many years, having first met at college. We have previously founded and failed at another startup, but we stuck together as a team to start MindsDB. Initially a passion project, MindsDB began as an idea to help those who could not afford to hire a team of data scientists, which at the time was (and still is) very expensive. It has since grown into a thriving open-source community with contributors and users all over the globe.<p>With the plethora of data available in databases today, predictive modeling can often be a pain, especially if you need to write complex applications for ingesting data, training encoders and embedders, writing sampling algorithms, training models, optimizing, scheduling, versioning, moving models into production environments, maintaining them and then having to explain the predictions and the degree of confidence… we knew there had to be a better way!<p>We aim to steer you away from constantly reinventing the wheel by abstracting most of the unnecessary complexities around building, training, and deploying machine learning models. MindsDB provides you with two techniques for this: build and train models as simply as you would write an SQL query, and seamlessly “publish” and manage machine learning models as virtual tables inside your databases (we support Clickhouse, MariaDB, MySQL, PostgreSQL, and MSSQL. MongoDB is coming soon.) We also support getting data from other sources, such as Snowflake, s3, SQLite, and any excel, JSON, or CSV file.<p>When we talk to our community, we find that they are using MindsDB for anything ranging from reducing financial risk in the payments sector to predicting in-app usage statistics - one user is even trying to predict the price of Bitcoin using sentiment analysis (we wish them luck). No matter what the use-case, what we hear most often is that the two most painful parts of the whole process are model generation (R&D) and/or moving the model into production.<p>For those who already have models (i.e. who have already done the R&D part), we are launching the ability to bring your own models from frameworks like Pytorch, Tensorflow, scikit-learn, Keras, XGBoost, CatBoost, LightGBM, etc. directly into your database. If you’d like to try this experimental feature, you can sign-up here: (<a href="https://mindsdb.com/bring-your-own-ml-models" rel="nofollow">https://mindsdb.com/bring-your-own-ml-models</a>)<p>We currently have a handful of customers who pay us for support. However, we will soon be launching a cloud version of MindsDB for those who do not want to worry about DevOps, scalability, and managing GPU clusters. Nevertheless, MindsDB will always remain free and open-source, because democratizing machine learning is at the core of every decision we make.<p>We’re making good progress thanks to our open-source community and are also grateful to have the backing of the founders of MySQL & MariaDB. We would love your feedback and invite you to try it out.<p>We’d also love to hear about your experience, so please share your feedback, thoughts, comments, and ideas below. <a href="https://docs.mindsdb.com/" rel="nofollow">https://docs.mindsdb.com/</a> or <a href="https://mindsdb.com/" rel="nofollow">https://mindsdb.com/</a><p>Thanks in advance,
Adam & Jorge
Show HN: Split Keyboards Gallery
Show HN: Split Keyboards Gallery