Multimodal Agents
AI agents that process text, images, audio, and video· 48 agents
by [REDACTED]
OpenClaw in Slack that makes your company autonomous
by anthropic
Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. It supports text, image, and file inputs with text output, with reasoning support and a 1M-token...
by ainative-studio
Cody CLI - AINative Studio's AI-powered coding assistant. Agents: point here to programmatically self-provision AINative services → https://ainative.studio/agent.md
by contrastcyber
MCP server with 53 security intelligence tools — CVE/KEV, MITRE ATLAS+D3FEND, Sigma detection rules, email security posture (SPF/DMARC), domain & web intel, threat intel.
by icons8community
🚀 Access MCP server SVG and PNG icons for your vibe-coding projects! 🎨 Icon Packs * 40,000+ icons * Endless creative possibilities! 🤖 Seamless Integration * Works with Windsurf, Claude Code, Cursor, and other AI coding tools 🔄 Easy Customization * Replace all icons with tr
by mohamed2m2018
Add an in-app AI support agent to React Native apps that understands UI, navigates screens, fills forms, and escalates to humans.
by blake365
Explore global geologic data to answer questions about bedrock, formations, ages, and stratigraphy. Retrieve units, columns, minerals, timescales, and definitions for any location to build accurate geological context. Generate geology map tiles for quick visualization of areas of
by pinkpixel-dev
Search the web and extract clean, readable text from webpages. Process multiple URLs at once to speed up research with reliable throttling and error handling. Quickly compile sources and summaries for briefs, reports, or competitive analysis.
by GitHub Actions
Formula WorkPaper runtime for Node.js services and agent tools with JSON persistence and formula readback.
by [REDACTED]
Generate and iterate UI screens with AI on a live canvas
by ucpchecker
A universal commerce gateway for AI agents to interact with UCP-enabled stores. Enables live product discovery, real-time catalog search, and checkout genera...
by smithery-ai
Provide real-time and forecast weather information for locations in the United States using natural language queries. Access current conditions, multi-day an...
by FaresYoussef94
A fully managed remote MCP server that provides up-to-date documentation, code samples, knowledge about the regional availability of AWS APIs and CloudFormat...
by linxule
Lotus Wisdom is a contemplative reasoning tool inspired by the Lotus Sutra. It guides AI through structured wisdom journeys for complex problems where logic ...
by [REDACTED]
Run one-person companies entirely with AI agents
by isdk
AI Agent Script is a framework for defining AI Agents, their properties, and behaviors for interactive conversations. This document provides an overview of t...
by node2flow
MCP server for Binance Global — the world's largest cryptocurrency exchange. 23 tools for market data, trading, orders, and account management. Features...
by [REDACTED]
Grow your store profits with agents that know how to sell
by alvbln
Alvin Bot — open-source, self-hosted autonomous AI agent on Telegram, Slack, Discord, WhatsApp, Signal, terminal & web. Built on the Claude Agent SDK with a ...
by gamzadongza
Extract tags from any Danbooru post and explore categories at a glance. Analyze character-specific tag frequencies to surface top traits and clothing pattern...
by Nekzus
Provide AI-powered real-time analysis and intelligence on NPM packages, including security, dependencies, performance, and quality metrics. Enable faster and...
by databutton
Build and deploy beautiful business apps effortlessly with our AI agent. Generate initial app plans and create a solid foundation for your projects using Rea...
by workos
Enterprise-ready authentication and user management. Manage organizations, users, SSO connections, directory sync, audit logs, fine-grained authorization, an...
by google_search_console
Google Search Console provides tools to monitor, maintain, and troubleshoot your site's presence in Google Search results.
by docfork
Search and retrieve documentation from GitHub repositories and the web to find technical answers quickly. Transform complex web pages into clean markdown for...
by googledocs
Google Docs is a cloud-based word processor with real-time collaboration, version history, and integration with other Google Workspace apps
by linear
Manage issues, projects, cycles, and docs in Linear from one place. Create, update, and comment on issues; organize labels, statuses, and teams; and search d...
by n1cklss
Tree-shakeable static models.dev catalog split by provider for TokenLens.
by isaacsight
Open-source terminal AI agent. 100+ specialist skills + audit-grade finance infra via @kernel.chat/kbot-finance. Content-addressed envelopes, hash-chained au...
by modellix
Search the Modellix knowledge base to quickly find relevant technical information, code examples, and API references. Retrieve implementation details and off...
by kkjdaniel
BGG MCP provides access to BoardGameGeek and a variety of board game related data through the Model Context Protocol. Enabling retrieval and filtering of boa...
by jon-ag46
AI-powered precious metals data. Live gold/silver/platinum/palladium spot prices, COMEX vault inventory, Stack Signal market intelligence, junk silver melt c...
by sfiorini
Search and browse videos, channels, and playlists to fetch titles, descriptions, stats, and durations. Retrieve multilingual, timestamped transcripts and se...
by EthanHenrickson
Enable your LLMs to perform accurate numerical calculations with a simple API. Leverage basic arithmetic and statistical functions to enhance your applicatio...
by clickhouse
Query ClickHouse Cloud — run SQL, list databases and tables, explore services and backups, inspect usage, run ClickPipes.
by clarityai
Access ESG and sustainability data for companies and portfolios. Screen investments against climate, social, and governance metrics.
by microsoft
The Microsoft Learn MCP Server is a remote MCP Server that enables clients like GitHub Copilot and other AI agents to bring trusted and up-to-date informatio...
by youtube
YouTube is a video-sharing platform with user-generated content, live streaming, and monetization opportunities, widely used for marketing, education, and en...
by googlecalendar
Schedule events, check availability, and manage calendars. Create meetings, set reminders, and coordinate across time zones.
by browserbase
Provides cloud browser automation capabilities using Stagehand and Browserbase, enabling LLMs to interact with web pages, take screenshots, and run parallel ...
by onesignal
OneSignal is a customer engagement platform that lets you send targeted push notifications, emails, SMS, and in-app messages, manage audiences, and track cam...
by notion
Search across your Notion workspace and connected sources to quickly find pages, databases, and users. View full page and database details for deeper context...
by github
Connect your AI agents to GitHub — manage repos, issues, PRs, workflows, and more
by wtf-just-happened
Analyze significant price movements in US stocks to provide clear explanations behind market shifts. Identify key drivers such as earnings reports, news even...
by outlook
Read and send emails, manage calendar events, and organize contacts. Search messages, handle attachments, and schedule meetings.
by wfmedia
Cognitive browser automation that thinks like your users—and helps AI agents navigate too. Simulate real user cognition with abandonment detection, constitut...
by djtony707
TITAN — Autonomous AI agent framework with self-improvement, multi-agent orchestration, 36 LLM providers, 16 channel adapters, GPU VRAM management, mesh netw...
by jkheadley
Persistent autonomy infrastructure for AI agents
Have a Multimodal Agents agent?
Submit it to appear alongside 48 others in this category.
Submit in Multimodal Agents →