Stop Coding Alone! Build an 'Opus + M2.7' AI Expert Team

May 11, 2026

🚨 Get 12% off any Token Plan tier: https://platform.minimax.io/subscribe/coding-plan?code=9PuYDYrfpN&source=link Resources: - M2.7 model card: https://hf.co/MiniMaxAI/MiniMax-M2.7 - M2.7 blog post: https://www.minimax.io/news/minimax-m27-en - MiniMax Token Plan: https://platform.minimax.io/subscribe/token-plan - Cache Commander (the repo I demo): https://github.com/juliensimon/cache-commander In this video, I run MiniMax M2.7 — an open-weight ~228B-parameter MoE model — as my coding model inside Claude Code. I show the Anthropic-Messages-compatible setup, demo MiniMax's recommended Opus → M2.7 hybrid workflow on a real Rust project (my cache-commander project), running a full code and test review, and sending a PR. ⭐️⭐️⭐️ More content on Substack at https://www.airealist.ai ⭐️⭐️⭐️

Transcript

Hi everybody, Julien here. Today I'm trying out Minimax M27, an open-weight model from China, as my coding model inside Cloud Code. I'll show you the setup, I'll run a full code review on one of my open-source projects, and I guess we'll send a pull request in the end. Let's get started. If you want to know more about Minimax m27, I recommend reading the blog post from Minimax, links in the video description. But because the model is open weight, we can also find a lot of good information on the Hugging Face page. Let's take a look. M27 is open weight and it's a mixture of experts model with 256 experts and 8 active experts per token. That's about 228 billion parameters and about 10 billion active parameters per token. Weights are natively FP8. So we see the model is definitely up there with the best closed models. And well, we'll see how it performs on my code base. In this demo, I'm using the Plus High Speed plan. Plus High Speed is a paper token plan that costs $40 a month or $400 a year. So here's what you actually get on Plus High Speed. Route it through M27 high speed, which is a faster variant of the model, faster than the standard tier. And on top of that, you get 9,000 characters a day for speech and 100 images a day for image generation. So if you want to play with the multi-model stack in Minimax, you don't have to pay extra for it. It's included in the plan. So the good news is, because you're watching this video, and then we have the Anthropic Messages compatible endpoint, which is a drop-in replacement for the Anthropic endpoint. So we basically just point Cloud Code to a different URL and with a different key. So when you sign up for your plan, you can generate an API key, and all you have to do is edit your Cloud Code settings, the settings.json file, and use those exact lines here, and of course just replay, the key with your exact key. And that's all there is to it. I have to say during my session, it worked flawlessly. I didn't see a single error using the Minimax API with Cloud Code and my plugins, et cetera, et cetera. So yeah, very simple and as simple as that. In this demo, I'm actually going to run everything in a Docker sandbox using the SBX command. Code and my plugins etc etc so yeah very simple and as simple as that in this demo I'm actually going to run everything in a docker sandbox using the sbx command and that lets me give the agent full permissions without having to worry for my local machine all right let's get some work done the repo I'm running this on is an open source project that I built called cache commander and it's a rust terminal UI that scans developer cache directories, pip, npm, homebrew, Hugging Face caches, and it looks for known CDEs in your packages, and of course it helps you find and easily delete all those unneeded packages and reclaim gigabytes or tens of gigabytes of disk space. I use Cloud to build this, Opus 4.6, Opus I'm gonna run cloud setup with Opus, and on the right, I'm gonna Run Cloud Code setup with Minimax, and that's why I'm passing a different cloud directory here, because I have those settings,.json lines that you saw just a second ago. Okay, so we'll just make sure here we're using Opus 4.7, and What about this one? OK, so this one we see Minimax is selected. So let's use the two models here. Let's maybe plan the code review with Opus, and let's execute it with Minimax, which is actually one of the recommended ways to use this model. OK, so I'm going to go to plan mode and say run a full code and test review. . This repository. So Opus is going to do its thing. It's going to plan the code review, maybe launch some agents, and then we'll get a plan in the end, and we'll use the plan with Minimax. All right, so Opus is done. It did a fairly extensive review. I have 22 cache providers in the tool, so there's quite a lot to look at. I asked for findings only, no remediation techniques. We'll let Minimax figure it out. Okay, so we have the path to that plan, and now we're just going to pass it to Minimax and let it actually fix. Okay, so now we can just take the plan that Opus wrote and say, based on the findings, fix issues in the repo, and dispatch agents to parallelize work. So it's going to read the plan. It's going to go through all the findings, and I'm pretty sure it's going A-Gents to fix all the issues that it found. So it's actually launching a lot of different agents to fix those different problems. ON2-3 performance, so that's bad. Fix URL encoding, etc. So now we're doing all those things in parallel. And my code base is getting better. All right, we're done. 30 minutes later, we have fixed all the issues of the plan. We've run all the tests. Everything is clean. So why don't we create a pull request and send it to the repo. The pull request is ready. Number 33. Let's take a quick look here. Yep, see it here. And it's got all the right details. The bug fixes, the tests. Now we have Github.ci running. Yeah, looks good. And well I guess I'll merge it and maybe release a new version based on this. And maybe to close things here, let's get the usage generated. So I started at zero and well I did consume 298 queries out of 4,500 for five hours. So I should be perfectly okay. And yeah, it's unlikely during this kind of work you will ever hit usage limits. And remember, that's only $40 a month. So this looks like a very good deal. So there you go, Minimax M27 in Cloud Code. I really like the fact that the compatibility looks excellent. It was very easy to configure Cloud to use Minimax. I didn't see any tool error or any kind of error.

Tags

AIMachine LearningTechnology

← Back to 2026 Videos ← Back to YouTube Overview