God, Love, News, Event, Entertainment, Amebo,..... All about Bringing out the best in you...
Show HN: Yapit – PDF and webpage reader with TTS that doesn't suck https://ift.tt/soeicjg
Show HN: Yapit – PDF and webpage reader with TTS that doesn't suck Yapit converts PDFs and web pages to audio, with a vision-LLM pipeline that handles math and complex layout instead of garbling them. I built it because I read a lot of papers and content online, but drift off after two paragraphs. Listening while following along keeps me focused and lowers the bar to actually start. Every TTS tool I tried broke on complex formatting. Papers with math, citations, figure references, page numbers in the middle of sentences. You either get garbled output or you're listening to raw LaTeX. Yapit converts everything to markdown as a common format. For web pages, defuddle ( https://ift.tt/IL8iVhH ) handles the extraction and strips clutter from web pages, presenting the main article content in a clean, consistent format. For PDFs, a vision LLM rewrites each page into markdown with annotation tags that separate what you see from what gets read aloud. Math is rendered visually but gets spoken alt text. Citations like "[13]" or "(Schmidhuber, 1970)" are silently displayed. Page numbers and headers are removed entirely. Both extraction and audio are cached by content hash, so the same content is never processed or synthesized twice. Self-hosting works with any OpenAI-compatible TTS server (vLLM-Omni, ...) and any OpenAI-compatible vision model for PDF extraction: git clone --depth 1 https://ift.tt/VKbjp8x && cd yapit cp .env.selfhost.example .env.selfhost make self-host Kokoro TTS also runs in the browser via WebGPU on desktop. Try it on Attention Is All You Need (all voices cached, no account needed): https://ift.tt/xgfkyQN... Or paste any URL: https://ift.tt/3PReqQ8 https://ift.tt/S40qcGK... GitHub: https://ift.tt/SWTZbaz (AGPL-3) https://ift.tt/SWTZbaz April 6, 2026 at 02:28AM
Subscribe to:
Post Comments (Atom)
Show HN: Gave Claude Code ADHD.. Now it thinks 3x better https://ift.tt/KNLE5an
Show HN: Gave Claude Code ADHD.. Now it thinks 3x better https://adhdstack.github.io/ May 27, 2026 at 02:00AM
-
submitted by /u/Dull_Tonight [link] [comments] source https://www.reddit.com/r/worldnews/comments/pehy48/housing_secretary_robert_je...
-
Show HN: A Spotify player in the terminal with full feature parity https://ift.tt/oZgrl1Q July 18, 2024 at 02:57AM
-
Show HN: Wallpapper Splitter for Many Desktop I've build an simple tool to split your wallpapers across multiple desktops. Now you can u...
No comments:
Post a Comment