Multimodal AI AI Agents

AI agents that integrate and process multiple types of data, such as text, images, audio, and video, to enable richer and more accurate interactions. These agents can perform tasks like image captioning, video analysis, and cross-modal search, offering versatile solutions for complex, real-world applications.

52 Agents
Resume Burger

Ivan Golovach

From résumé to interview – in no time.

AGiXT

JoshXT

AI platform advancing AGI via adaptive memory, smart features, & plugins for seamless AI task execution.

Open-source AI chat app with multimodal interactions, RAG, plugins, and multi-provider support.

Langroid

langroid

Open-source framework for building LLM-powered apps with multi-agent collaboration and NLP tasks.

OfficeIQ

Webuters Technology

OfficeIQ is an AI tool that automates tasks and boosts business efficiency.

Lumivar

Lumivar

Transform customer interactions with AI agents for the Automotive Industry

Atomic Agents

Kenny Vaneetvelde

5.0
(1)

Atomic Agents is designed around to be an extremely lightweight, modular and maintainable Agentic framework

Thrax

Thrax AI

AI that understands your PDFs

EmbedAI

EmbedAI

Platform for creating custom AI chatbots powered by ChatGPT, trained on your data, and embeddable on your web.

Dify Ai

LangGenius, Inc.

Open-source platform for building and managing generative AI applications with ease.

Reka AI

Reka AI

Reka AI develops multimodal language models capable of processing text, images, video, and audio inputs.

A generalist multi-agent system for solving complex tasks with autonomous agents.