Multimodal AI AI Agents

AI agents that integrate and process multiple types of data, such as text, images, audio, and video, to enable richer and more accurate interactions. These agents can perform tasks like image captioning, video analysis, and cross-modal search, offering versatile solutions for complex, real-world applications.

56 Agents
AgentFi

AgentFi

AI-powered blockchain agents for decentralized finance.

Neos

Neos Labs Inc.

Decentralized platform combining AI and blockchain to advance global research.

Hermes 3

NousResearch

Open-source AI model with advanced long-term context and multi-turn conversation capabilities.

A TypeScript toolkit to simplify AI-driven app development with multiple model integrations.

Advanced AI for enhanced reasoning and multi-modal inputs with text-to-image generation capabilities.

Abacus AI

Abacus AI

Enterprise-grade AI platform for scalable, end-to-end machine learning and AI agent development.

OpenAI’s Operator automates software development, travel bookings, and other tasks.

Octonet AI

Octonet.ai

A decentralized platform providing scalable and cost-effective AI and machine learning solutions.

Sora

OpenAI

Sora generates photorealistic, cinematic videos from text or image prompts, revolutionizing creative video.

Together AI

Together AI

Comprehensive AI platform for training and deploying private models for enterprises.

Reface

Reface

AI tool for deepfake creation and real-time face swapping in videos.

NVIDIA’s AI tool creates real-time, voice-synced 3D facial animations.