What It Is

Polymage is a Python library I’m building to orchestrate multimodal AI agents across different platforms and models (Gemma, moondream, Flux, HiDream, and others). The goal: write an agent once, run it anywhere.

Supported Platforms

Local inference:

  • Ollama – macOS, Windows, Linux
  • LM Studio – macOS, Windows, Linux
  • DrawThings – macOS only

Cloud inference:

  • Groq – high-speed LLM inference
  • Cloudflare Workers AI – edge-deployed models
  • Together AI – open-source model hosting
  • HuggingFace – model hub and inference API

Core Features

  • Define agents with prompts plus multimodal inputs (image, audio, video, text)
  • Run the same agent on multiple platforms and models to compare results directly
  • Workflow orchestration via plain Python scripts or Apache Airflow

Technical Stack

  • Python core library
  • Docker for reproducible environments
  • Hermes Agent and Strix Agent frameworks for orchestration
  • OpenCode for AI-assisted development

Status

Active development. I’m using it for personal projects and refining it through the EPFL Applied Data Science program. Local LLM inference is trickier than it looks – resource management matters more than the benchmarks suggest. And coordinating multiple agents is genuinely harder than running one.