God, Love, News, Event, Entertainment, Amebo,..... All about Bringing out the best in you...
Show HN: AnyCrawl v0.0.1-alpha.5 – custom user-agent and richer scraping API https://ift.tt/UKC95fW
Show HN: AnyCrawl v0.0.1-alpha.5 – custom user-agent and richer scraping API ## [0.0.1-alpha.5] - 2025-06-14 ### Added - Integrated AWS S3 storage support with new `S3` class and environment variables for seamless file uploads and retrievals. - Introduced `FileController` for serving files from S3 or local storage with robust path validation and error handling. - Added multiple content transformers (Screenshot, `HTMLTransformer`) improving HTML/Markdown extraction and screenshot generation. - Extended scraping capabilities with new options: output `formats`, `timeout`, tag filtering, `wait_for`, retry strategy, viewport configuration, and custom user-agent support. - Added Safe Search parameter to `SearchSchema` for filtered search results. - Refactored engine architecture with a factory pattern and new core modules for configuration validation, data extraction, and job management. - Implemented graceful shutdown handling for the API server and improved logging for uncaught exceptions / unhandled rejections. - Added Jest configuration for API and library packages with ESM support and updated test scripts. - Updated CI workflows to publish Docker images on version tags. - Expanded README with detailed environment variable descriptions and API usage examples. ### Changed - Refined error handling in `ScrapeController` and `JobManager`; failure responses now include structured error objects and HTTP status codes. - Enhanced `BaseEngine` with explicit HTTP error checks and resilience improvements. - Updated OpenAPI documentation to reflect new scraping parameters and error formats. - Migrated key-value store name to environment configuration for greater flexibility. - Enhanced per-request credit tracking in `ScrapeController` and enhanced logging middleware to include credit usage. ### Fixed - Improved job failure messages to include detailed error data, ensuring clearer debugging information. - Minor documentation corrections and clarifications. https://ift.tt/gZslB6b June 14, 2025 at 07:48AM
Subscribe to:
Post Comments (Atom)
Show HN: Display.dev – the agent-agnostic workspace for HTML and .md artifacts https://ift.tt/LoMzhV8
Show HN: Display.dev – the agent-agnostic workspace for HTML and .md artifacts https://display.dev/ June 18, 2026 at 03:27AM
-
submitted by /u/Dull_Tonight [link] [comments] source https://www.reddit.com/r/worldnews/comments/pehy48/housing_secretary_robert_je...
-
Show HN: The Ordeal Visualizer I'm this time-blind... https://ift.tt/iNM3UDr July 8, 2025 at 12:20AM
-
Show HN: A Spotify player in the terminal with full feature parity https://ift.tt/oZgrl1Q July 18, 2024 at 02:57AM
No comments:
Post a Comment