Tools
Built-in Tools (Multimodal Models)
Multimodal model and media tool capabilities
Multimodal Model Tools
Multimodal capabilities are provided by a combination of built-in backend tools and external tool services.
Common providers and modules live in monkey-tools-third-party-api, including image, video, visual-generation, media, and provider-specific adapters such as Fal, Volcengine Visual, Runway, Tripo, Google Gemini, Vertex AI, and OpenAI.
Usage Pattern
- Deploy the tool service.
- Import its
manifest.jsoninto Studio. - Add the corresponding tool node to a workflow or agent.
- Store generated files in the configured object storage or asset system.