Show HN: Gemini Cursor – A Multimodal AI Cursor for Your Desktop (Open Source) https://ift.tt/8lRrswk

Show HN: Gemini Cursor – A Multimodal AI Cursor for Your Desktop (Open Source) I built Gemini Cursor, an open-source multimodal AI cursor that guides users through tasks on their desktop by pointing and speaking. It leverages Gemini 2.0 Flash and Google's live multimodal API to analyze what's on screen and provide real-time assistance. In this demo, my friend tries to add a payment method to Amazon, and the AI cursor walks them through the entire process with visual cues and spoken instructions. I've also used it to interpret diagrams from research papers—curious to see what other use cases people find this useful for! Demo: https://ift.tt/Nh4A5Dd Repo: https://ift.tt/QDHKY2s https://ift.tt/QDHKY2s February 11, 2025 at 03:38AM

No comments:

Show HN: Handwritten Cards – Send Cards Online in Your Own Unlegible Handwriting https://ift.tt/eRAjBuV

Show HN: Handwritten Cards – Send Cards Online in Your Own Unlegible Handwriting Ever feel that online cards are too impersonal, while physi...