Google Cloud Vision API enables developers to integrate vision detection features into applications, including image labeling, face and landmark detection, optical character recognition (OCR), and explicit content tagging.
Tool to perform image detection and annotation for batch files in Google Cloud Vision. Supports PDF, TIFF, and GIF files. Extracts up to 5 frames (GIF) or pages (PDF/TIFF) from each file and performs detection for each image. Use when you need to analyze documents or multi-page images with features like text detection, label detection, face detection, or other Vision API capabilities.
Tool to run asynchronous image detection and annotation for a list of generic files (PDF, TIFF, GIF). Use when processing multi-page documents that may contain multiple images per page. Results are written to Google Cloud Storage and progress can be tracked via the returned operation name using VisionGetOperation.
Run image detection and annotation for a batch of images using Google Cloud Vision API. Performs various types of image analysis including face detection, landmark detection, logo detection, label detection, text detection (OCR), safe search detection, image properties, crop hints, web detection, product search, and object localization. Supports up to 16 images in a single batch request. Each image can have multiple feature types analyzed simultaneously.
Tool to run asynchronous image detection and annotation for a batch of images. Use when processing multiple images or large images that require longer processing time. Results are written to Google Cloud Storage as JSON files.
Tool to run image detection and annotation for a batch of images scoped to a specific project and location. Performs various types of image analysis including label detection, face detection, landmark detection, logo detection, OCR text detection, safe search detection, image properties, crop hints, web detection, product search, and object localization. Supports processing up to 16 images per request with regional endpoint routing (us, asia, eu). Use this when you need to analyze images with location-specific processing for content extraction, text recognition, object detection, face identification, or landmark/logo recognition.
Run image detection and annotation for a batch of images scoped to a specific project. This action performs various types of image analysis including label detection, face detection, landmark detection, logo detection, OCR text detection, safe search detection, and more. You can specify multiple detection types for each image and process up to 16 images per request. Use this when you need to analyze images for content, extract text, detect objects, or identify faces, landmarks, or logos. The action supports images from Google Cloud Storage, HTTP/HTTPS URLs, or base64-encoded content.