All Categories
Layers

Multimodal Agents

AI agents that process text, images, audio, and video· 35 agents

15
Top Score
12
Avg Score
0 of 35
Verified
21
Free / Freemium
🏆 ATH #2
🥇
LL
Llama 4

by Meta AI

15
score

Meta's open-source multimodal models. Llama 4 Scout (17B active, 16 experts, 10M context) and Maverick (17B active, 128 experts, 1M context). Released April ...

LayersMultimodal Agents52.05
🏆 ATH #2
🥈
GE
Gemini

by Google DeepMind

15
score

Google DeepMind's multimodal AI assistant. Gemini 2.5 Pro with native thinking, 1M token context, and tight integration across Google Workspace, Android, and Search.

LayersMultimodal AgentsFreemium56.05
🏆 ATH #2
🥉
HU
HuggingChat

by Hugging Face

15
score

Hugging Face's open-source chat UI for any model. Access Llama 4, DeepSeek, Mistral, Gemma, and 100+ open-weight models. Free, no API key required.

LayersMultimodal AgentsFree52.05
🏆 ATH #2
#4
QW
Qwen 3.5

by Alibaba Cloud / Tongyi

15
score

Alibaba's flagship open-source LLM. 235B MoE (22B active). Multilingual, strong on coding and math. Qwen3-Coder variant matches Claude Code on HumanEval.

LayersMultimodal Agents52.05
#5
AU
Autoresearch

by uditgoenka

14
score

Claude Autoresearch Skill — Autonomous goal-directed iteration for Claude Code. Inspired by Karpathy's autoresearch. Modify → Verify → Keep/Discard → Repeat ...

LayersMultimodal AgentsFree51.05
🏆 ATH #24
#6
AW
Awesome Openclaw Usecases Zh

by AlexAnys

14
score

🇨🇳 OpenClaw中文用例与案例大全 | 46个真实场景 | 国内特色 + 海外案例的国内适配 | 自动化办公·内容创作·运维·AI助理·知识管理 | 新手友好 | Chinese guide for OpenClaw AI agent use cases

LayersMultimodal AgentsFree51.1
🏆 ATH #24
#7
CO
Context7

by upstash

14
score

Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors

LayersMultimodal AgentsFree52
🏆 ATH #24
#8
CO
CopilotKit

by CopilotKit

14
score

The Frontend Stack for Agents & Generative UI. React + Angular. Makers of the AG-UI Protocol

LayersMultimodal AgentsFree51.05
🏆 ATH #24
#9
ZE
Zeroshot

by covibes

14
score

Your autonomous engineering team in a CLI. Point Zeroshot at an issue, walk away, and return to production-grade code. Supports Claude Code, OpenAI Codex, Op...

LayersMultimodal AgentsFree50.1
🏆 ATH #24
#10
CL
Clawpanel

by qingchencloud

14
score

🦞 OpenClaw 可视化管理面板 — 内置 AI 助手(工具调用 + 图片识别 + 多模态),一键安装 | Visual management panel with built-in AI assistant (tool calling + vision + multimodal + i18n(11))

LayersMultimodal AgentsFree51.1
🏆 ATH #24
#11
ZC
Zcf

by UfoMiao

14
score

Zero-Config Code Flow for Claude code & Codex

LayersMultimodal AgentsFree50.1
🏆 ATH #24
#12
AI
AionUi

by iOfficeAI

14
score

Free, local, open-source 24/7 Cowork app and OpenClaw for Gemini CLI, Claude Code, Codex, OpenCode, Qwen Code, Goose CLI, Auggie, and more | 🌟 Star if you l...

LayersMultimodal AgentsFree51.05
🏆 ATH #24
#13
NE
Nexu

by nexu-io

14
score

The simplest desktop client for OpenClaw 🦞 — bridge your Agent to WeChat, Feishu, Slack & Discord in one click. Works with Claude Code, Codex & any LLM. BYO...

LayersMultimodal AgentsFree51.1
🏆 ATH #24
#14
N8
N8n

by n8n-io

14
score

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

LayersMultimodal AgentsFree51.05
🏆 ATH #25
#15
AU
Autogen

by microsoft

14
score

A programming framework for agentic AI

LayersMultimodal AgentsFree50.15
🏆 ATH #24
#16
CL
Claude Code Book

by lintsinghua

14
score

《御舆:解码 Agent Harness》42万字拆解 AI Agent 的Harness骨架与神经 —— Claude Code 架构深度剖析,15 章从对话循环到构建你自己的 Agent Harness。在线阅读网站:

LayersMultimodal AgentsFreemium51.1
🏆 ATH #8
#17
SO
Sora 2

by OpenAI

12
score

OpenAI's second-generation video model. Cinema-quality 1080p video up to 60 seconds from text, image, or video. Physics simulation, precise camera control.

LayersMultimodal AgentsPaid51.05
#18
JU
Julius AI

by Julius AI

12
score

AI data analyst. Upload CSV, Excel, SQL databases — get visualizations, insights, and statistical analysis in plain English. No coding required.

LayersMultimodal AgentsFreemium49
#19
OB
Obviously AI

by Obviously AI

12
score

No-code predictive AI. Connect any data source, train an ML model in 2 minutes, and get predictions on churn, sales, fraud, and pricing without any code.

LayersMultimodal AgentsPaid51
#20
OP
OpenAI: GPT 5.4

by openai

10
score

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K ou...

LayersMultimodal AgentsUsage58.05
#21
OH
Oh My Pi

by can1357

10
score

⌥ AI Coding agent for the terminal — hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more

LayersMultimodal AgentsFree49.05
#22
QW
Qwen2.5 Coder 7B Instruct
10
score

Qwen2.5 Coder 7B Instruct — a text-generation model by undefined on HuggingFace. 2,522,695 downloads, 678 likes.

LayersMultimodal AgentsFree64.05
#23
OP
OpenAI: GPT 5.3 Codex

by openai

10
score

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reaso...

LayersMultimodal AgentsUsage65.05
#24
QW
Qwen: Qwen3.5 122B A10B

by qwen

10
score

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-ex...

LayersMultimodal AgentsUsage58.05
#25
QW
Qwen: Qwen3.5 35B A3B

by qwen

10
score

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mix...

LayersMultimodal AgentsUsage55.05
#26
QW
Qwen: Qwen3.5 9B

by qwen

10
score

Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9...

LayersMultimodal AgentsUsage49
#27
AG
AgentGuide

by adongwanai

10
score

https://adongwanai.github.io/AgentGuide | AI Agent开发指南 | LangGraph实战 | 高级RAG | 转行大模型 | 大模型面试 | 算法工程师 | 面试题库 | 强化学习|数据合成

LayersMultimodal AgentsFreemium43.05
#28
AN
Anthropic: Claude Sonnet 4.6

by anthropic

10
score

Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative...

LayersMultimodal AgentsUsage51
#29
QW
Qwen: Qwen3.5 Flash

by qwen

10
score

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-expe...

LayersMultimodal AgentsUsage51
#30
QW
Qwen: Qwen3.5 Plus 2026 02 15

by qwen

10
score

The Qwen3.5 native vision-language series Plus models are built on a hybrid architecture that integrates linear attention mechanisms with sparse mixture-of-e...

LayersMultimodal AgentsUsage51
#31
GP
Gpt Engineer

by AntonOsika

10
score

CLI platform to experiment with codegen. Precursor to: https://lovable.dev

LayersMultimodal AgentsFree67.05
Hot
#32
NB
NBLM2PPTX

by laihenyi

10
score

Convert NotebookLM PDFs to PPTX with separated background images and editable text layers using Gemini AI

LayersMultimodal AgentsFree77.05
#33
QW
Qwen: Qwen3.5 27B

by qwen

10
score

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed ...

LayersMultimodal AgentsUsage51
#34
KW
Kwaipilot: KAT Coder Pro V2

by kwaipilot

10
score

KAT-Coder-Pro V2 is the latest high-performance model in KwaiKAT’s KAT-Coder series, designed for complex enterprise-grade software engineering and SaaS inte...

LayersMultimodal AgentsUsage51
#35
AU
Auto Claude Code Research In Sleep

by wanshuiyin

10
score

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment auto...

LayersMultimodal AgentsFree51.05

Have a Multimodal Agents agent?

Submit it to appear alongside 35 others in this category.

Submit in Multimodal Agents