Google Cloud Vision MCP Integration | AI Agent Tools

Google Cloud Vision

Google Cloud Vision API enables developers to integrate vision detection features into applications, including image labeling, face and landmark detection, optical character recognition (OCR), and explicit content tagging.

Completely Secure

Composio Managed

Users

80634

Tools

Last Updated

12h ago

Tools

Page 1 of 5

Annotate Files With Vision Api

Tool to perform image detection and annotation for batch files in Google Cloud Vision. Supports PDF, TIFF, and GIF files. Extracts up to 5 frames (GIF) or pages (PDF/TIFF) from each file and performs detection for each image. Use when you need to analyze documents or multi-page images with features like text detection, label detection, face detection, or other Vision API capabilities.

Async Batch Annotate Files

Tool to run asynchronous image detection and annotation for a list of generic files (PDF, TIFF, GIF). Use when processing multi-page documents that may contain multiple images per page. Results are written to Google Cloud Storage and progress can be tracked via the returned operation name using VisionGetOperation.

Annotate Images

Run image detection and annotation for a batch of images using Google Cloud Vision API. Performs various types of image analysis including face detection, landmark detection, logo detection, label detection, text detection (OCR), safe search detection, image properties, crop hints, web detection, product search, and object localization. Supports up to 16 images in a single batch request. Each image can have multiple feature types analyzed simultaneously.

Annotate Images Async Batch

Tool to run asynchronous image detection and annotation for a batch of images. Use when processing multiple images or large images that require longer processing time. Results are written to Google Cloud Storage as JSON files.

Annotate Location Images

Tool to run image detection and annotation for a batch of images scoped to a specific project and location. Performs various types of image analysis including label detection, face detection, landmark detection, logo detection, OCR text detection, safe search detection, image properties, crop hints, web detection, product search, and object localization. Supports processing up to 16 images per request with regional endpoint routing (us, asia, eu). Use this when you need to analyze images with location-specific processing for content extraction, text recognition, object detection, face identification, or landmark/logo recognition.

Annotate Project Images

Run image detection and annotation for a batch of images scoped to a specific project. This action performs various types of image analysis including label detection, face detection, landmark detection, logo detection, OCR text detection, safe search detection, and more. You can specify multiple detection types for each image and process up to 16 images per request. Use this when you need to analyze images for content, extract text, detect objects, or identify faces, landmarks, or logos. The action supports images from Google Cloud Storage, HTTP/HTTPS URLs, or base64-encoded content.

Category

Tools

Annotate Files With Vision Api

Async Batch Annotate Files

Annotate Images

Annotate Images Async Batch

Annotate Location Images

Annotate Project Images