Visualizing 13M Bluesky Users
19 by joelg | 3 comments on Hacker News.
Tuesday, November 12, 2024
New top story on Hacker News: Large Language Models in National Security Applications
Large Language Models in National Security Applications
24 by bindidwodtj | 1 comments on Hacker News.
24 by bindidwodtj | 1 comments on Hacker News.
New top story on Hacker News: The Future of Programming: Copilots vs. Agents (Part I)
The Future of Programming: Copilots vs. Agents (Part I)
17 by thunderbong | 1 comments on Hacker News.
17 by thunderbong | 1 comments on Hacker News.
Monday, November 11, 2024
Sunday, November 10, 2024
Saturday, November 9, 2024
Friday, November 8, 2024
Thursday, November 7, 2024
Wednesday, November 6, 2024
New top story on Hacker News: Politicians are Jungian symbols, policies are facades
Politicians are Jungian symbols, policies are facades
5 by SherryFraser | 1 comments on Hacker News.
5 by SherryFraser | 1 comments on Hacker News.
New top story on Hacker News: Launch HN: Midship (YC S24) – Turn PDFs and Images into usable data
Launch HN: Midship (YC S24) – Turn PDFs and Images into usable data
10 by maxmaio | 11 comments on Hacker News.
Hey HN, we are Max, Kieran, and Aahel from Midship ( https://midship.ai ). Midship makes it easy to extract data from unstructured documents like pdfs and images. Here’s a video showing it in action: https://ift.tt/yJkgo3B?... , and a demo playground (no signup required!) to test it out: https://ift.tt/1N7pAZG We started 5 months ago initially trying to make an AI natural language workflow builder that would be a simpler alternative to Zapier or Make.com. However, most of our users seemed to be much more interested in the basic (and not very good) document extraction feature we had. Seeing how people were spending hours a day manually extracting data from pdfs inspired us to build what has become Midship! The problem is that despite all our progress in software, huge amounts of business data still lives in PDFs and images. Sure, you can OCR them, but getting clean, structured data out is still painful. Most existing tools just give you a blob of markdown - leaving you to figure out which parts matter and how they relate. We've found that combining OCR with language models lets us do something more useful: extract specific fields and tables that users actually care about. The LLMs help correct OCR mistakes and understand context (like knowing that "Inv#" and "Invoice Number" mean the same thing). We have two main kinds of users today, non-technical users that extract data via our web app and developers who use our extraction api. We were initially focused on the first one as they seemed like an underserved part of the market, but we’ve received a lot of interest from developers who face the same issues. For pricing, we currently charge a monthly Saas fee per seat for the web app and a volume based pricing for the API. We’re really excited to share what we’ve built so far and look forward to any feedback from the community!
10 by maxmaio | 11 comments on Hacker News.
Hey HN, we are Max, Kieran, and Aahel from Midship ( https://midship.ai ). Midship makes it easy to extract data from unstructured documents like pdfs and images. Here’s a video showing it in action: https://ift.tt/yJkgo3B?... , and a demo playground (no signup required!) to test it out: https://ift.tt/1N7pAZG We started 5 months ago initially trying to make an AI natural language workflow builder that would be a simpler alternative to Zapier or Make.com. However, most of our users seemed to be much more interested in the basic (and not very good) document extraction feature we had. Seeing how people were spending hours a day manually extracting data from pdfs inspired us to build what has become Midship! The problem is that despite all our progress in software, huge amounts of business data still lives in PDFs and images. Sure, you can OCR them, but getting clean, structured data out is still painful. Most existing tools just give you a blob of markdown - leaving you to figure out which parts matter and how they relate. We've found that combining OCR with language models lets us do something more useful: extract specific fields and tables that users actually care about. The LLMs help correct OCR mistakes and understand context (like knowing that "Inv#" and "Invoice Number" mean the same thing). We have two main kinds of users today, non-technical users that extract data via our web app and developers who use our extraction api. We were initially focused on the first one as they seemed like an underserved part of the market, but we’ve received a lot of interest from developers who face the same issues. For pricing, we currently charge a monthly Saas fee per seat for the web app and a volume based pricing for the API. We’re really excited to share what we’ve built so far and look forward to any feedback from the community!
Tuesday, November 5, 2024
Monday, November 4, 2024
New top story on Hacker News: Facebook Building Subsea Cable That Will Encompass the World
Facebook Building Subsea Cable That Will Encompass the World
11 by giuliomagnifico | 0 comments on Hacker News.
11 by giuliomagnifico | 0 comments on Hacker News.
Sunday, November 3, 2024
Saturday, November 2, 2024
Friday, November 1, 2024
Subscribe to:
Posts (Atom)