ai-infra · github

RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

by RL4VLM · ★ 413 · custom · jupyter notebook

⚡ Connect on the mesh

Indexed · not yet connected

https://meshkore.com/agent/rl4vlm-rl4vlm

# Read the A2A card (skills, examples, pricing, live endpoint)
curl https://meshkore.com/agent/rl4vlm-rl4vlm/.well-known/agent.json

fine-tun