Reallocating $100/Month Claude Code Spend to Zed and OpenRouter

(braw.dev)

53 points | by kisamoto 3 hours ago

15 comments

wiether 1 hour ago
People may feel differently about the fee that OpenRouter takes, but I think the service they provide is worth the extra cost.
Having access to dozens of models through a single API key, tracking cost of each request, being able to run the same request on different models and comparing their results next to each other, separating usages through different API keys, adding your own presets, setting your routing rules...
And once you start using an account with multiple users, it's even more useful to have all those features!
Not relying on a subscription and having the right to do exactly what you want with your API key (using it with any tool/harness...) is also a big plus to me.
[-]
- pixel_popping 56 minutes ago
  Expect you don't have the right to do what you want with the API Key (see waves of ban lately, many SaaS services have closed because of it).
  [-]
  - embedding-shape 53 minutes ago
    Unless you provide some more details, at least outline what "do what you want" was in your case, this seems like just straight up FUD.
    [-]
    - himata4113 0 minutes ago
      openrouter accepts crypto so might have been some money laundering involved for reselling dirty crypto for llm api.
      if that wasn't the reason, hey that's actually a great way to launder money (not financial advice).
ElFitz 1 hour ago
Has anyone (other than OpenClaw) used pi? (https://shittycodingagent.ai/, https://pi.dev/)
Any insights / suggestions / best practices?
[-]
- simgt 1 hour ago
  Yes, it's super cool. Check Mario's latest talk: https://www.youtube.com/watch?v=Dli5slNaJu0 Armin also has some videos covering it on his channel: https://www.youtube.com/@ArminRonacher/ Pi's Discord is still nice, even though it was a bit flooded after the openclaw thing.
- nocobot 37 minutes ago
  i really have been enjoying pi a lot
  at first i thought i was goring to build lots of extra plugins and commands but what ended up working for me is:
  - i have a simpel command that pulls context from a linear issue
  - simple review command
  - project specific skills for common tasks
delduca 53 minutes ago
I also dropped Claude Code Max.
I switched to OpenCode Zen + GitHub Copilot. For some reason, Claude Code burns through my quota really quickly.
https://opencode.ai/zen
[-]
- woutr_be 48 minutes ago
  How does Claude Code compare to OpenCode Zen? I’m on the $20/month Claude plan, and was considering OpenCode Zen as well.
  Due to the quota changes, I actually find myself using Claude less and less
  [-]
  - criley2 44 minutes ago
    I haven't tried $20 claude code recently, but I've used OpenCode Zen primarily so I can play with opensource/chinese models which are very inexpensive. I'd spend $0.50-$1.00 on a single claude opus 4.6 plan mode run, then have a chinese model execute the plan for like $0.10-$0.15 total. I'd keep context short, constantly start new threads, and get laser focused markdown plans and knowledgebase to be token efficient.
    If I just let opencode zen run claude opus to plan and execute, I'd spend $20 in like 5 minutes lol
supernes 1 hour ago
On the topic of Zed itself as a VSCode replacement - my experience is mixed. I loved it at first, but with time the papercuts add up. The responsiveness difference isn't that big on my system, but Zed's memory usage (with the TS language server in particular) is scandalous. As far as DX goes it's probably at 85% of the level VSCode provides, but in this space QoL features matter a lot. Oh, and it still can't render emojis in buffers on Linux...
cbg0 1 hour ago
I don't think there's currently better value than Github's $40 plan which gives you access to GPT5 & Claude variants. It's pay per request so not ideal for back-and-forth but great for building complex features on the cheap compared to paying per token.
Because GH is accessing the API behind the scenes, you should face less degradation when using Sonnet/Opus models compared to a Claude subscription.
Keep a ChatGPT $20 subscription alongside for back-and-forth conversations and you'll get great bang for buck.
[-]
- rafaelmn 1 hour ago
  I'm still paying the 10$ GH copilot but I don't use it because :
```
  - context is aggressively trimmed compared to CC obviously for cost saving reasons, so the performance is worse
  - the request pricing model forces me to adjust how I work
```
  Just these alone are not worth saving the 60$/month for me.
  I like the VSCode integration and the MCP/LSP usage surprised me sometimes over the dumb grep from CC. Ironically VSCode is becoming my terminal emulator of choice for all the CLI agents - SSH/container access and the automatic port mapping, etc. - it's more convenient than tmux sessions for me. So Copilot would be ideal for me but yeah it's just tweaked for being budget/broad scope tool rather than a tool for professionals that would pay to get work done.
  [-]
  - lbreakjai 36 minutes ago
    You can use your GH subscription with a different harness. I'm using opencode with it, it turns GH into a pure token provider. The orchestration (compacting, etc.) is left to the harness.
    It turns it into a very good value for money, as far as I'm concerned.
    [-]
    - rafaelmn 20 minutes ago
      But you still get charged per turn right ? I don't like that because it impacts my workflow. When I was last using it I would easily burn through the 10$ plan in two days just by iterating on plans interactively.
  - briHass 44 minutes ago
    Disagree entirely.
    GHCP at least is transparent about the pricing: hit enter on a prompt= one request. CC/Codex use some opaque quota scheme, where you never really know if a request will be 1,2,10% of your hourly max, let alone weekly max.
    I've never seen much difference with context ostensibly being shorter in GHCP, all of the models (in any provider) lose the thread well before their window is full, and it seems that aggressive autocompaction is a pretty standard way to help with that, and CC/Codex do it frequently.
    [-]
    - rafaelmn 23 minutes ago
      >I've never seen much difference with context ostensibly being shorter in GHCP, all of the models (in any provider) lose the thread well before their window is full, and it seems that aggressive autocompaction is a pretty standard way to help with that, and CC/Codex do it frequently.
      Then we've had wildly different results. Running CC and GH copilot with Opus 4.6 on same task and the results out of CC were just better, likewise for Codex and GPT 5.4. I have to assume it's the aggressive context compaction/limited context loading because tracking what copilot does it seems to read way less context and then misses out on stuff other agents pick up automatically.
- neya 53 minutes ago
  Is your source code worth only $40 for them to train their models on?
  https://www.techradar.com/pro/bad-news-skeptics-github-says-...
  [-]
  - cbg0 51 minutes ago
    Considering how much data they already have from everything that's on GitHub, I doubt you would make a dent boycotting their AI product.
- walthamstow 29 minutes ago
  Google $20/mo plan has great usage for Claude Opus. Last time I used it, around Feb, it felt basically unlimited.
bashtoni 1 hour ago
After hitting Claude limits today I spent the afternoon using OpenCode + GLM 5.1 via OpenRouter and I was very impressed.
OpenCode picked up my CLAUDE.md files and skills straight away, and I got similar performance to Opus 4.6.
_pdp_ 1 hour ago
Our bank (a major retail bank in UK) is refusing doing business with OpenRouter and OpenRouter issued a refund which we did not request. So something is up. There is that.
I might be paranoid but I feel that access to models will become more constraint in the future as the industry gets more regulated.
[-]
- chid 42 minutes ago
  I don't quite understand what you mean by something is up. Was the reason around security/telemetry or similar?
  [-]
  - _pdp_ 34 minutes ago
    Bank refused to provide reasons - even after a formal complaint was raised with them.
    We are not the only one. I found other people online experiencing the same issue. It is hard to tell how wide-spread this is but it is strange to say the least.
pixel_popping 59 minutes ago
It should be noted about Openrouter that you aren't allowed to expose the access to end users, it has to be for internal usage only, which can be fatal as they have made waves of account banning lately (without warnings).
[-]
- numlocked 37 minutes ago
  You are absolutely allowed to expose access to end users, as long as you continue to abide by terms of service. We have hundreds, if not thousands, of apps built on openrouter that in turn have end users of their own. We showcase many of them on our /apps ranking page!
urnfjrkrkn 1 hour ago
I would suggest to explore paid plans on different providers. Much better value than plans bundled with editors or API based usage in openrouter. And Chinese companies have versions hosted in Singapore or US.
Also ditching Claude Code is mistake. It is quite capable model, and still great value. I would keep it, even if it's just for code reviews and planning. Anthropic allows pro plans use in Zed.
philipp-gayret 1 hour ago
I like and do use Zed but be aware functionality like Hooks is not supported for their integration with Claude Code, as a heavy user of Hooks I would stick with the terminal.
[-]
- kisamoto 1 hour ago
  I'm always interested in how people use tools. I like to have a full editor to review code as a complement to the CLI and as I don't often use hooks the integration is also good enough for me.
  1. What do you use the hooks for?
  2. Do you use an editor alongside the CLI to review code or only examine the diffs?
Computer0 1 hour ago
I have had credits on open router that haven’t been deleted since near the projects launch, I believe 365 days is not a rule but rather a right reserved.
[-]
- numlocked 1 hour ago
  COO of OpenRouter here. Thats right — we haven’t done it to date but we can’t have unlimited liabilities stacking up forever. At some point we will start expiring credits from accounts that have seen zero activity in over a year.
  [-]
  - blitzar 18 minutes ago
    Maybe a bad suggestion, but can you do an inactivity "fee" - 25% / year (min $5) or something similar. I like the pre-pay system everyone in Ai seems to have settled on, its better than the AWS bills that we all know and love.
  - kisamoto 1 hour ago
    Thank you for taking the time to explain that - makes sense. I lifted what was present in your terms of service as I'd like to understand the minimum time I have.
hhthrowaway1230 1 hour ago
note: doesn't openrouter charge 5.5% fee?
[-]
- kisamoto 1 hour ago
  You are absolutely correct, I was not aware of this. I will update the article accordingly and perhaps it's more worthwhile to stay solely on Cursor with the limited models.
  Sadly Zed seems to add 10% so it's still more worthwhile to use OpenRouter.
  [-]
  - cedws 1 hour ago
    I feel like a bit of an idiot because I didn’t know this either. I just assumed OR was another startup burning money to provide models at cost.
    OpenRouter is a valuable service but I’ll probably try to run my own router going forward.
  - Kelteseth 1 hour ago
    Come on at least write the Hackernews replies yourself.
    [-]
    - kisamoto 1 hour ago
      I did. Perhaps too much consumption of AI responses but articles and engagement are written by me - a human.
      [-]
      - cbg0 1 hour ago
        That's exactly what a clanker would say. ^/s
    - glitchcrab 1 hour ago
      Only the opening sentence has an AI smell; the rest is definitely written by a fleshy meatbag
Computer0 1 hour ago
When I use the tool ccusage it says I use $600 of usage a month for my $100. I don’t know that this is a good value proposition for me if I want to stay with the same model, half the reason I use Claude code, personally.
[-]
- blitzar 1 hour ago
  > Reallocating $100/Month Claude Code Spend
  The new gimped claude code limits means my claude code spend the last month is $131. It cost me $20. I did an additional spend $5 on extra usage which cost me $5.
  While VC's are setting fire to money I am going to warm my hands.
  [-]
  - 542458 35 minutes ago
    I think it is worth noting that “what they charge for api access” != “marginal cost of inference”. So I don’t think getting i.e. $40 of api usage for $20 would be insane. $131 for $20 does probably mean somebody is losing money though.
- vanillameow 50 minutes ago
  I ran this just now and for a small web-app I built I used over $50 in a single day. This was using superpowers plugin and almost exclusively coordinating through Opus. Could I get by with 100$ a month without the subscription? Maybe, but I pay for the convenience of just being able to throw Opus with lavish plugins at it (with 5h limits that are, in my opinion, pretty reasonable). I don't really WANT to have to think about when Haiku or Sonnet are enough.
  If anything I would consider switching to OpenAI subscription (if I didn't despise them even more than Anthropic as a company), but converting to API use seems completely infeasible to me. I'd have to severely cut back on my use for not much benefit, other than having maybe an agent thats a little less jank than CC.
  [-]
  - blitzar 27 minutes ago
    Depending on your workflow, in the spirit of reallocating $100/Month subscription, it may be worth dropping to the $20/Month plan (or equivalent at other providers) and then pay as you go on the (rare) occasions you "build a small web-app I built and used over $50 in a single day".
    But at that point we are just min/maxing the details, and all I can say is if you are on a $100/$200 a month subscription to any of these services and not using them regularly then you shouldn't be on a $200 subscription any more than you should be on a $700 a month gym membership when you go every 3 months for 15 minutes.
i_love_retros 1 hour ago
I can't believe people are spending $100 a month on this! You're all mad!
[-]
- kisamoto 22 minutes ago
  I had a similar opinion a couple of years ago, content with more of an autocomplete.
  Now I'm happy with agents as the models and harnesses have improved significantly but the token usage comes at a cost.
- gozzoo 1 hour ago
  some are spending 100/day or even 1000/day. they must really be mad :)
  [-]
  - i_love_retros 1 hour ago
    Drunk on perceived power
    [-]
    - nubg 1 hour ago
      Your ignorance is our opportunity :)
Serberus 1 hour ago
[dead]