What It Is
Polymage is a Python library I’m building to orchestrate multimodal AI agents across different platforms and models (Gemma, moondream, Flux, HiDream, and others). The goal: write an agent once, run it anywhere.
Supported Platforms
Local inference:
- Ollama – macOS, Windows, Linux
- LM Studio – macOS, Windows, Linux
- DrawThings – macOS only
Cloud inference:
- Groq – high-speed LLM inference
- Cloudflare Workers AI – edge-deployed models
- Together AI – open-source model hosting
- HuggingFace – model hub and inference API
Core Features
- Define agents with prompts plus multimodal inputs (image, audio, video, text)
- Run the same agent on multiple platforms and models to compare results directly
- Workflow orchestration via plain Python scripts or Apache Airflow
Technical Stack
- Python core library
- Docker for reproducible environments
- Hermes Agent and Strix Agent frameworks for orchestration
- OpenCode for AI-assisted development
Status
Active development. I’m using it for personal projects and refining it through the EPFL Applied Data Science program. Local LLM inference is trickier than it looks – resource management matters more than the benchmarks suggest. And coordinating multiple agents is genuinely harder than running one.