Show HN: Docsumo's OCR Benchmark Report – Surpassing Mistral and Landing AI https://ift.tt/dK1W90T

Show HN: Docsumo's OCR Benchmark Report – Surpassing Mistral and Landing AI We recently conducted an in-depth benchmark comparing Docsumo's proprietary OCR technology against Mistral OCR and Landing AI's Agentic Document Extraction. Our objective was to evaluate their performance in real-world document processing tasks, especially with complex layouts and low-quality scans. Key Findings: Accuracy: Docsumo's OCR demonstrated higher precision in text extraction across various document types, including invoices and bank statements. Layout Preservation: Our technology maintained the original structure of documents more effectively, ensuring better usability of extracted data. Processing Speed: Docsumo achieved faster processing times, making it more suitable for high-volume document processing tasks. To ensure transparency and reproducibility, we've made the benchmark results publicly accessible. You can explore side-by-side outputs, accuracy scores, and layout comparisons here: https://ift.tt/TM7pAOu For a comprehensive breakdown of our methodology and detailed findings, please refer to our full report: [Insert blog link] Inviting the community to review our findings and share insights on the readiness of generative OCR tools for production environments. Are they truly up to the task? https://ift.tt/kcHauB1 April 3, 2025 at 03:41AM

Show HN: Nix Ninja – open-source Ninja-compatible build system for Nix https://ift.tt/tlKu9si

Show HN: Nix Ninja – open-source Ninja-compatible build system for Nix Hello, this is Edgar! We are open sourcing nix-ninja, an incremental build system for Nix that leverages dynamic derivations. For those unfamiliar, Nix is a powerful package manager and build system that ensures reproducible builds via strict dependency isolation. However, one of its limitations has been that it builds packages as monolithic units - if you change one source file, Nix rebuilds the entire package from scratch. nix-ninja solves this by bringing compilation unit level granularity to Nix builds. It targets ninja build files, a format for describing build graphs output by popular build systems like CMake and meson. This means that when you modify a single source file, only the affected compilation units need to be rebuilt, significantly reducing build times. We're excited because this gives Nix fine-grained caching with early cutoff optimisation (see the Build Systems à la Carte paper). Combined with remote build farms like Nixbuild.net, this moves Nix into the incremental cloud build systems arena, alongside Google's Bazel and Meta's Buck2. Dynamic derivations and content-addressed derivations are still experimental features in Nix, so we're hoping to accelerate their maturation by providing a compelling use-case and implementation. Our north star is to have Hydra (nixpkgs' CI runner) support incremental compilation in nixpkgs for slow builds like LLVM. nix-ninja can compile Nix itself today, but we're still very early in its development. Given community interest in dynamic derivations and incremental compilation in Nix, we decided to open source it in an pre-alpha state to involve the community in its design. Please take a look. We'll be available in the comments to answer any questions: https://ift.tt/y4uTxWm https://ift.tt/y4uTxWm April 3, 2025 at 12:47AM

Show HN: Zxc – Rust TLS proxy with tmux and Vim as UI, BurpSuite alternative https://ift.tt/izAbX48

Show HN: Zxc – Rust TLS proxy with tmux and Vim as UI, BurpSuite alternative Ditch Burp Suite’s bloat for zxc, a Rust-built, terminal-based proxy that uses tmux and Vim to intercept HTTP/S and WebSocket traffic. It captures requests for debugging, security testing, or tweaking—fast and lean. # Key Features - Disk Wizardry: Stashes massive datasets on disk-100k+ entries without breaking a sweat. - Addons: Boost your workflow with default support for ffuf and sqlmap, or craft your own addons for extra fun. - Buffer Tweaks: Edit variables in a popup (e.g., b:host, b:scheme) in Interceptor/Repeater to twist requests. - Config Control: TOML files for global ($HOME/.config/zxc/config.toml) or per-session tweaks. - Content Filtering: Skip requests based on the request Content-Type header. - Domain Filtering: selectively include or exclude specific domains, offering granular control over which traffic is proxied or relayed, with support for wildcards like *.example.com - Edit Config on the Fly: Tweak session settings live from History in a popup-changes hit instantly or refresh manually if edited outside. - Encoding Tricks: Base64 or URL encode/decode in Visual mode-sneaky. - Extended Attributes: Supercharge your workflow with `.req` files automatically tagged with critical metadata (e.g., user.host, user.http) - break free from the sandbox and unlock powerful integration with external tools like scripts or analyzers. - Extension Filtering: Skip requests based on the requested contents extension `.mp3`, `.mp4` etc. - History Display Filters: Tweak History logs by host, URI, or status code with Vim regex flair. - History Window: View and filter all traffic in real-time. - Interception Queue: Manage pending requests and responses in real-time—view the queue with scheme and host details, then forward, drop, or tweak them as they pile up in the Interceptor window. - Malformed Requests: Custom HTTP/1.1 parser for sending quirky, security-testing requests. - Repeater Window: Resend and tweak HTTP or WebSocket requests with ease - Request Sharing: Share requests freely between windows for seamless tweaking and testing. - Search Superpowers: Search requests or responses and add to Vim’s quickfix/location lists. - Session Management: Create named sessions and attach to older sessions to resume work seamlessly. - Traffic Interception: Edit requests and responses live in Vim. - WebSocket History: A clear, organized history view of all WebSocket traffic with `.whis` files for a full overview, or dive into single-session details with `.wsess` files. - WebSocket: Proxy and replay WebSocket traffic. For complete list of features and screenshots refer the repo, https://ift.tt/hrPcWBH . https://ift.tt/hrPcWBH April 3, 2025 at 02:41AM

Show HN: Open-source AI text editor that runs on your own hardware https://ift.tt/1r2vPua

Show HN: Open-source AI text editor that runs on your own hardware https://manzoni.app/ April 2, 2025 at 12:13AM

Show HN: Pressure – a simple 2D board game in JavaScript https://ift.tt/5Aq6MxK

Show HN: Pressure – a simple 2D board game in JavaScript I'm a fan of minimalistic design and I tried to make an "easy to learn, hard to master" 2D board game. Pressure is played on a 5×5 grid with 12 tokens total. The key mechanic is a "push" feature where tokens can displace entire lines of other tokens if there's an empty space at the end of the line. When opponent tokens are pushed, they become inactive for one turn. Capture occurs when tokens are surrounded on all four sides. After analyzing gameplay data, I found Pressure has an average branching factor of 11.5, which is higher than Othello (10) and Checkers (2.8), though lower than Chess (35). Its game tree complexity is approximately 10^57, placing it remarkably close to Othello (10^58) and much higher than Checkers (10^40), despite the smaller board size. The state space complexity appears to be in the range of 10^20, comparable to Checkers, which is notable given Pressure uses a 5×5 grid versus Checkers' 8×8 board. Chess and Othello have higher state spaces at 10^44 and 10^28 respectively. What's computationally interesting is how the push mechanic generates this complexity. About 27.5% of all moves involve pushing, which creates a multiplicative effect on the decision space. Each push can reposition multiple tokens simultaneously and create cascading positional consequences beyond what the compact board size would suggest. I've found the average game length is approximately 50 moves, similar to Othello (58) but shorter than Chess (70) and Checkers (70). Let me know what do you think about the game. https://ift.tt/pew8dHo April 2, 2025 at 01:57AM

Show HN: Share what you dreamt about and see if others had the same one https://ift.tt/8rdDcvZ

Show HN: Share what you dreamt about and see if others had the same one Share what you dreamt about and get notified when others also had the same dream. https://ift.tt/srcVS9X April 1, 2025 at 11:15PM

Show HN: Await-Tree – Visualize Async Rust Task Execution in Real-Time https://ift.tt/gkHM1yQ

Show HN: Await-Tree – Visualize Async Rust Task Execution in Real-Time https://ift.tt/vg5OULF April 1, 2025 at 10:46PM

Show HN: Docsumo's OCR Benchmark Report – Surpassing Mistral and Landing AI https://ift.tt/dK1W90T

Show HN: Docsumo's OCR Benchmark Report – Surpassing Mistral and Landing AI We recently conducted an in-depth benchmark comparing Docsum...