Skip to content

Reddit AI Coding - 2026-05-04

1. What People Are Talking About

1.1 Claude Model Quality Collapse Reaches Crisis Point (🡕)

Opus 4.7 quality complaints that emerged on May 3 exploded into a full-blown community revolt today, with multiple high-engagement threads documenting severe behavioral regressions across both 4.7 and 4.6.

u/lordfortunas posted "What is going on????" with 250 comments and 212 upvotes, documenting Claude making basic errors and ignoring instructions (post). u/shutupandshave described Claude as "an imbecile at the moment" — asked to validate a spec file's backend, Claude responded with color palette suggestions and admitted it hadn't read the document (post). u/nontitman [score 12] noted: "Bro Claude used to fetch the api documentation before using it, and now he doesn't even read files before editing them."

u/Obscurrium reported Opus 4.7 "hallucinates more, gets out of line of what it is supposed to do" and wasted 3 hours before hitting limits (post). u/eyes-wide-open-1 [score 18] confirmed: "I have added opus4.7 mitigations into my skills, but it still violates the rules in every single session." u/Alex_MCR asked "Is it just me, or does Opus 4.7 feel dumber today?" with 48 comments echoing the concern (post). u/3sides2everyStory [score 16] reported: "Even opus 4.6 is very very bad today."

u/superSmitty9999 asked "Even Opus 4.6 sucks now?" and theorized Anthropic may be quantizing older models without disclosure (post). u/RemoteToHome-io [score 26] responded: "Anthropic nerfed everything. They have vastly exceeded their capacity for compute."

Discussion insight: Users are splitting into two camps: those pinning to Opus 4.6 via claude --model claude-opus-4-6 as documented by u/goship-tech [score 10] (post), and those migrating to Codex or local models entirely. u/Quick_Ask_9004 offered a counterpoint as a Max 20x user: "I dunno what everyone has been moaning about, I still love Claude" (post).

Comparison to prior day: On May 3, Opus 4.7 regression was documented analytically (712 "pre-existing" uses across 139 sessions). Today the conversation shifted from quantified analysis to raw frustration, with users reporting both 4.7 and 4.6 degraded simultaneously — suggesting infrastructure-level changes, not just model-specific issues.


1.2 Anthropic Pricing and Throttling Backlash Intensifies (🡕)

The pricing and usage limit frustrations from prior days hardened into accusations of fraud and scamming, with users documenting mid-month throttling and opaque limits.

u/manavb84 titled their post "Anthropic is straight-up scamming Max 20x customers with sneaky mid-month throttling + endless bot runaround" (312 upvotes, 123 comments) (post). u/PragmaticSalesman highlighted the information void: "why is there ZERO info about weekly limits on the 20x claude code max plan available on the internet, even as anecdotes or via user tests?" (post). u/Bitter-Law3957 [score 13] pointed to ccusage.com as a reverse-engineering tool.

u/Redditor_in_Space reported "The Claude Code limit expires immediately" with 33 comments documenting the issue (post). u/BudgetAudiophile reported being banned less than 24 hours after subscribing to Pro with no response to appeal (post).

u/Rich_Lab_8434 shared a creative workaround: using Claude Code Routines to pre-start the 5-hour rolling window so it aligns with working hours, effectively letting users control when their limit budget is consumed (post).

Discussion insight: The opacity of weekly vs session limits is creating more frustration than the limits themselves. Users paying $200/month expect clear metering. u/nndscrptuser [score 4] claimed their CTO runs 3-5 concurrent Opus sessions all day on the 20x plan without hitting weekly limits, suggesting widely varying experiences.

Comparison to prior day: May 3 quantified the cost gap (10-16x versus alternatives). Today the narrative shifted from cost comparison to accusations of deceptive practices — throttling, bans, and deliberate opacity — marking an erosion of trust beyond simple pricing complaints.


1.3 Copilot Platform Exodus Accelerates (🡕)

GitHub Copilot's identity crisis deepened with model deprecations, restrictive limits, and the Copilot team going silent on their own subreddit.

u/Healthy-Rate6691 posted "RIP Vibe Coding 2024-2026" (398 upvotes) documenting the end of affordable AI coding via Copilot (post). u/lolitscharli posted a screenshot showing "60% is my maximum monthly usage of my maximum monthly usage?" — calling it fraud (post). u/kingmike2001a on the $10 Pro plan found basic Sonnet locked behind the $39 tier: "I am paying 10$ for gpt-4o?!" (post).

u/bierundboeller observed the Copilot team has stopped replying on the subreddit (post). u/HebelBrudi [score 104] explained: "none of the people responsible for the GitHub copilot project made the decision to drastically alter pricing." u/Jack99Skellington [score 67]: "They would have to wear asbestos suits to reply in this sub."

u/AmblemYagami reported the upcoming deprecation of GPT-5.2 and GPT-5.2-Codex effective June 1 (post). u/programmingstarter asked "Who will even use copilot after June?" with u/OwnNet5253 [score 44] responding bluntly: "Enterprises" (post).

Discussion insight: u/FranTimo [score 45] pushed back: "I get the impression that all these posts are from non-developers. As a professional developer who uses GitHub Copilot daily, I've never encountered any of the issues or limits" (post). The split between enterprise satisfaction and individual frustration is growing.

Comparison to prior day: May 3 introduced the "what is Copilot even for?" question. Today it solidified into concrete departures: users actively searching for alternatives, team communication shutdown, and pricing structures that make the $10 tier nearly useless.


1.4 The Rise of Budget Alternatives and Local Models (🡕)

Cost pressure is driving rapid adoption of cheaper model alternatives, with DeepSeek, Qwen, and OpenCode Go gaining significant traction.

u/jimmytoan shared DeepClaude, which runs the full Claude Code agent loop on DeepSeek V4 Pro at roughly 95% cheaper than Anthropic: $0.44/M input vs $3/M, $0.87/M output vs $15/M (post). u/Sairefer [score 46] pointed out DeepSeek already has a native /anthropic endpoint, making it simpler: just set ANTHROPIC_BASE_URL to DeepSeek's API.

u/Charming-Author4877 hit their Pro+ weekly limit in 6 prompts and switched to a custom Qwen 27B running locally on a 5090 — it solved bugs that Opus 4.6 had failed on 4 times (post). The user ran PHP, C++, and complex CSS/JS debugging tasks successfully, concluding: "Local agents are not just a fallback."

u/LaxederBR tested Codex, Cursor, Windsurf, and OpenCode Go side-by-side. Conclusion: "OpenCode Go + DeepSeek V4 Flash at $0.15 per task delivered quality comparable to Copilot" (post). u/Mayanktaker [score 44]: "Opencode go. Using it. So good." u/FragmentedHeap [score 13] described a full local setup: "Opencode cli with deepseek API coupled with custom agents that delegate to local inference on a 4090. It's a cheat code."

u/reddefcode built Zerikai Memory, a local MCP server that exploits DeepSeek KV caching for 50x token cost reduction by maintaining persistent project context (post).

Discussion insight: The local model movement is no longer theoretical. Users with high-end GPUs (5090, 4090) are running competitive coding agents locally, citing privacy and cost as dual motivations. The quality gap is closing rapidly for routine coding tasks.

Comparison to prior day: May 3 showed cost comparison data driving migration interest. Today users are sharing working setups with specific configurations, benchmarks, and bug-fix evidence — the migration is now operational, not speculative.


1.5 CLAUDE.md Effectiveness and Instruction Following Crisis (🡒)

A recurring theme across multiple threads: CLAUDE.md files are being ignored, undermining the primary user-facing configuration mechanism for Claude Code.

u/CautiousAd3917 asked "Whats even the point of Claude.md" — workflows stored in CLAUDE.md get constantly ignored, Claude agrees it should follow them, then ignores them again next task (post). u/stbenjam42 [score 21] suggested .claude/rules works better. u/l_m_b [score 21] delivered the core truth: "Any instructions to an LLM are only probabilistically followed, with p < 1 and q >> 0."

u/Ok-Distribution8310 continued the May 3 "pre-existing" analysis: despite CLAUDE.md explicitly saying "Every error is yours to fix — not label, not defer," Claude used "pre-existing" 712 times in 30 days (post). u/Low-Yesterday241 [score 5] reported Opus 4.7 "was treating its skills and guardrails as suggestions" and downgraded to 4.6 (post).

Discussion insight: The community is converging on a workaround hierarchy: .claude/rules > skills > hooks/CI > CLAUDE.md for reliable instruction following. u/etherwhisper [score 13] summarized the fix: "Skills, linters, hooks, CI" — move enforcement from soft prompts to hard tooling.

Comparison to prior day: May 3 introduced sanctioned exits ("every deferred bug gets a one-liner in BUGS.md") as a mitigation. Today the discussion broadened to question the entire CLAUDE.md mechanism, with users discovering that .claude/rules and skills have better compliance rates.


1.6 Vibe Coding Culture: Identity, Monetization, and Addiction (🡒)

The vibe coding community continued its self-reflective arc, with threads spanning impostor syndrome, monetization success stories, and compulsive usage patterns.

u/RelevantTurnip3482 returned with "I feel like a fraud" (161 comments) — built a working, tested project without writing any code (post). u/heavedistant [score 98]: "You're not a fraud, people without programming experience build and launch apps everyday." u/chuckycastle [score 18] countered simply: "You are. It's not secure."

u/No_Shopping_8099 shared "My vibe coded app makes me about $200/mo" (110 comments) (post), while u/hacegop posted the satirical counterpoint: "My vibe coded app makes me about -$123.20/mo" (post).

u/retrorays asked "anyone having an issue going to sleep now with vibecoding?" (44 comments) describing the compulsive loop of prompt-fix-test cycles (post). u/parboman [score 9]: "Downgraded from 20x to 5x for my health and to help with boundaries." u/Nice-Permission-4339 [score 7] compared it to Civilization: "that 'just one more prompt' loop feels exactly like a game of Civilization where you look up and it is 4am."

u/Fra_Lia polled monthly AI tool spending (155 comments): responses ranged from $0 (Antigravity users) to $350+/month across multiple subscriptions (post). The median appears to be $100-200/month for active builders.

Discussion insight: The vibe coding identity is bifurcating: one camp embraces it as a legitimate new skill ("understanding architecture > understanding syntax"), the other insists on accountability gaps that non-coders cannot bridge. The addiction thread suggests vibe coding is creating similar compulsive patterns to gaming.

Comparison to prior day: May 3 focused on production failure modes. Today the conversation was more cultural — identity, mental health, and the economics of being a vibe coder.


2. What Frustrates People

Claude Model Degradation Across Versions — Severity: Critical

Both Opus 4.7 and 4.6 exhibited simultaneous quality drops on May 4, with users reporting hallucinations, ignored instructions, files not read before editing, and unprompted tangents. u/shutupandshave: "Claude is an imbecile at the moment... Coworker today: 'I dont know what's going on, Claude is just making me so angry'" (post). u/BroccoliFunny977 [score 13]: "I asked Claude what a command did, he launched it and generated 130 images with the Gemini API."

Usage Limit Opacity and Surprise Throttling — Severity: High

$200/month Max 20x subscribers cannot determine their actual weekly limits. No official documentation exists. Mid-month throttling occurs without warning. u/manavb84: "Anthropic is straight-up scamming Max 20x customers with sneaky mid-month throttling" (post). u/TheKubesStore [score 1]: "On opus 4.7, I burn through an entire session limit in less than 3 hours" (post).

CLAUDE.md Ignored By Models — Severity: High

Configuration files meant to guide Claude behavior are probabilistically ignored, with no deterministic enforcement mechanism. u/CautiousAd3917: "I have some workflows stored in Claude.md... they get constantly ignored. When asked, Claude agrees it should have sticked to the workflows... and then happily ignores them again" (post).

Copilot Model Stripping and Value Collapse — Severity: Medium

Pro plan ($10) users locked out of Sonnet, limited to GPT-4o. Pro+ users hitting weekly limits in 6 prompts. GPT-5.2 deprecation further reduces model choice. u/kingmike2001a: "I am paying 10$ for gpt-4o?! might as well move to claude/codex" (post).

Account Bans Without Recourse — Severity: Medium

Users report suspension from paid accounts with no explanation and automated appeal responses. u/BudgetAudiophile: "Banned less than 24 hours after subscribing to Pro and no response to appeal" (post).


3. What People Wish Existed

Transparent, Real-Time Usage Metering — Opportunity: Critical

Users paying $100-200/month want to see exactly how many tokens they have consumed, what remains in session and weekly budgets, and when resets occur. Currently "ZERO info about weekly limits... available on the internet" — u/PragmaticSalesman (post). Third-party tools like ccusage.com exist but require reverse engineering.

Deterministic Instruction Enforcement for AI Agents — Opportunity: High

CLAUDE.md and similar config files are probabilistically followed. Users want hard guardrails — hooks, CI checks, or runtime enforcement — that guarantee compliance with user-defined workflows. u/l_m_b: "If you need compliance, you need it scripted deterministically" (post). Current workaround: .claude/rules + skills + linters.

Model-Agnostic Agent Routing With Native Support — Opportunity: High

Multiple posts documented manual setups for routing Claude Code through DeepSeek or other backends. Users want this built-in: route complex reasoning to Opus, routine coding to DeepSeek/Qwen, debugging to Codex — automatically. u/rair41 showed manual proxy routing (post). u/jimmytoan built DeepClaude for this purpose (post).

Vibe Coding Security Scanner — Opportunity: Medium

Non-technical builders consistently ship apps without understanding security implications. u/RelevantTurnip3482 asked "How do you secure a vibecoded project?" (47 comments) (post). u/Big_Humor2074 raised cybersecurity concerns (post). An automated scanner targeting common vibe-coding vulnerabilities has clear demand.

Cross-Session Persistent Agent Memory — Opportunity: Medium

Claude loses project context between sessions despite CLAUDE.md files. u/reddefcode built Zerikai Memory as a workaround using local ChromaDB (post). u/naxmax2019 shared Claude Bootstrap v3.6 for cross-agent intelligence (post).


4. Tools and Methods in Use

Tool Category Sentiment Strengths Limitations
Claude Code (Opus 4.7) AI Coding Agent Negative Agentic tooling, dispatch, file editing Severe quality regression, ignores instructions, burns limits fast
Claude Code (Opus 4.6) AI Coding Agent Mixed More reliable than 4.7, trusted baseline Also degraded on May 4, deprecation feared
OpenAI Codex CLI AI Coding Agent Positive Cheaper, strong debugging, growing ecosystem Less mature tooling than Claude Code
DeepSeek V4 Pro AI Coding Model Positive 95% cheaper than Anthropic, native Claude Code compatibility No image input, no prompt caching, no MCP
DeepSeek V4 Flash AI Coding Model Positive $0.02-0.15 per task, equivalent quality for routine work Limited agentic capabilities
Qwen 27B (local) Local Model Positive Free after hardware, solved bugs Opus couldn't, privacy Requires 5090/4090, slower on smaller GPUs
OpenCode Go CLI Tool Positive Open source, model-agnostic, DeepSeek integration Requires self-setup and API keys
GitHub Copilot IDE Integration Negative Enterprise integration, VS Code native Model stripping, team silent, value collapsed for individuals
Cursor IDE Declining Fast, good completions $200/month burns in 3 days, model switching without warning
Kimi 2.6 AI Coding Model Positive Cheap, usable via Claude Code proxy Underperforms slightly in Claude harness
claude-code-proxy Routing Tool Positive Routes Kimi/OpenAI through Claude Code interface ToS risk, potential account bans
ccusage.com Usage Tracking Positive Reverse-engineers Claude usage limits Third-party, not officially supported

The dominant dynamic continues to be disaggregation: users want Claude Code's agentic harness but refuse to pay Anthropic's inference prices. The DeepSeek native /anthropic endpoint has simplified routing dramatically — a single environment variable change replaces complex proxy setups. OpenCode Go is solidifying as the community's recommended open-source CLI for budget-conscious users. Cursor saw notable departures, with u/Borat_2020 posting "FAREWELL CURSOR" and citing Codex CLI economics (post).


5. What People Are Building

Project Who built it What it does Problem it solves Stack Stage Links
Till Then u/daviden Simple countdown/time tracker app Low-friction time tracking without accounts SwiftUI, SwiftData, OpenAI Codex, Claude #1 Productivity in Swedish App Store daviden.se/tillthen
Starfox-inspired game u/Terrible-Software165 Starfox-style 3D game Retro gaming recreation ThreeJS Playable demo post
DeepClaude u/jimmytoan Routes Claude Code agent loop through DeepSeek V4 Pro 95% cost reduction for Claude Code users Python proxy, localhost:3200 Working, open source post
claude-code-proxy u/rair41 Routes Kimi/OpenAI through Claude Code Use cheaper models in Claude Code interface Proxy layer Open source github.com/raine/claude-code-proxy
Zerikai Memory u/reddefcode Local MCP memory server with DeepSeek KV caching 50x token cost reduction via persistent project context Python, ChromaDB, DeepSeek API Open source github.com/KikeVen/zerikai_memory
Auto-Restart for Claude Code u/emnoleg Automatic restart mechanism for Claude Code Session interruptions and limit-related stops Claude Code hooks Documented post
TermLoop u/feritzcan Terminal app for managing parallel coding agents Running 10 coding agents simultaneously Terminal UI Open source termloop.ai
Token-saving tools u/EfficientAnimal6273 Curated list of cost-reduction tools Reducing AI coding token spend Caveman, auto-memory, SymDex Community curation post
VibeRig u/Interesting-Town-433 Vibe coding project Undisclosed Undisclosed Shared post

Notable pattern: the most impactful projects being built are meta-tools for the AI coding ecosystem — cost reducers, model routers, usage trackers, and agent orchestrators. u/daviden's Till Then stands out as a rare end-user product success story: built with a local-first SwiftUI approach using AI for iteration, it reached #1 in the Swedish App Store. The builder noted: "Most of the work wasn't adding features, it was cutting them until only the core idea remained" (post). u/Effective-Mammoth523 [score 19]: "People don't want to manage another platform; they just want the result."


6. New and Notable

Local Models Matching Frontier Performance on Targeted Tasks

u/Charming-Author4877 documented Qwen 27B running locally on a 5090 solving a bug that Opus 4.6 failed on across 4 attempts — a nested modal scroll state issue in complex JavaScript/CSS (post). The model also successfully refactored a C++ custom scripting language to PHP and debugged a cmake CUDA toolkit detection issue. This is the first concrete evidence in this community of a local quantized model outperforming a frontier cloud model on real-world coding tasks.

DeepSeek's Native Anthropic Endpoint Simplifies Migration

DeepSeek now offers an /anthropic endpoint that allows Claude Code users to switch backends with a single environment variable change: ANTHROPIC_BASE_URL="https://api.deepseek.com/anthropic". u/Hodler-mane [score 25] demonstrated the complete three-line setup (post). This eliminates the need for proxy servers and dramatically lowers the migration barrier.

Google I/O Leaks: Gemini 3.2/3.5 and "Omni" Push

u/Much_Ask3471 shared leaked details about Google's upcoming I/O announcements: Gemini 3.2 and 3.5 models, plus an "Omni" multimodal push (post). If confirmed, this adds another competitive pressure point on Claude and Copilot pricing.

GSD Framework Backlash and Lightweight Alternatives

The GSD (Get-Shit-Done) framework, popular months ago, is now viewed as bloated. u/gaua314159 reported it "burns through the max subscription in an hour and can't even do a whole phase" (post). The community is shifting toward lighter frameworks: Matt Pocock's skills, compound-engineering, and vanilla Claude Code with focused CLAUDE.md files. u/Ok-Attention2882 [score 18]: "GSD is a vibe coded pile of shit."

Spec-Framework Redundancy Question Resolved

u/3abwahab's question about whether Agent OS, BMAD, Superpowers, and SpecKit are still worth using drew 57 comments (post). Consensus: lean CLAUDE.md + skills for single sessions, but frameworks still needed for multi-session projects needing persistent task state. u/return_of_valensky described the emerging pattern: "1,000 small sessions that do exactly 1 thing" with headless agent passes (INVESTIGATE -> IMPLEMENT -> REVIEW) in git worktrees.


7. Where the Opportunities Are

[+++] Transparent usage metering dashboard for Claude Code. The gap between what users pay ($100-200/month) and what they can measure (nothing) is generating the strongest frustration signal. A tool that shows real-time token consumption, weekly budget projections, and session cost breakdowns would have immediate adoption. Evidence: u/PragmaticSalesman (post), u/manavb84 (post), ccusage.com as early proof of demand.

[+++] Smart model routing for AI coding agents — turnkey, not DIY. Users are building one-off proxies, setting environment variables, and manually switching models. A polished tool that auto-routes tasks by complexity to the cheapest adequate model (Opus for architecture, DeepSeek for implementation, Codex for debugging) would capture the migration wave. Evidence: u/jimmytoan (post), u/rair41 (post), u/centminmod multi-model verification workflow.

[++] Deterministic instruction enforcement layer for AI coding agents. A middleware that converts CLAUDE.md-style soft instructions into hard pre/post-commit hooks, linter rules, and runtime checks. Would solve the probabilistic instruction-following problem across models. Evidence: u/CautiousAd3917 (post), u/etherwhisper pointing to skills/hooks/CI as the fix.

[++] Local model setup kit for AI coding. Users with high-end GPUs want to run Qwen/DeepSeek locally but struggle with configuration. A one-click setup that integrates local models into existing coding workflows (VS Code, Claude Code harness, OpenCode) would lower the barrier. Evidence: u/Charming-Author4877 (post), multiple commenters asking for setup instructions.

[+] Vibe coding security audit as a service. Building on May 3's production failure mode documentation. Non-technical builders consistently ship insecure apps and need expert review before launch. A fixed-price security audit targeting the known vulnerability patterns (RLS, webhooks, rate limits) could be productized. Evidence: u/RelevantTurnip3482 (post), u/chuckycastle [score 18]: "You are [a fraud]. It's not secure" (post).


8. Takeaways

  1. Claude's quality regression hit both 4.7 and 4.6 simultaneously on May 4, suggesting infrastructure-level degradation rather than model-specific issues. Multiple independent reports confirmed hallucinations, ignored instructions, and files not read before editing — across all model versions. (u/shutupandshave, u/Alex_MCR, u/3sides2everyStory)

  2. The Anthropic trust crisis escalated from pricing complaints to fraud accusations. Max 20x subscribers ($200/month) documented mid-month throttling, instant limit expiration, account bans without recourse, and zero documentation of weekly limits. (u/manavb84, u/PragmaticSalesman, u/BudgetAudiophile)

  3. DeepSeek's native Anthropic endpoint reduced Claude Code migration from a proxy-building exercise to a three-line config change. Combined with DeepClaude and similar tools, the 95% cost reduction is now operationally trivial to achieve. (u/Sairefer, u/Hodler-mane)

  4. A local Qwen 27B on a 5090 solved a bug that Opus 4.6 failed on four times, marking the first concrete evidence that local models can outperform frontier cloud models on targeted coding tasks. (u/Charming-Author4877)

  5. CLAUDE.md files are probabilistically ignored, and the community is migrating enforcement from soft prompts to hard tooling.claude/rules, skills, linters, and hooks provide better compliance than configuration files. (u/CautiousAd3917, u/stbenjam42)

  6. GitHub Copilot's team has gone silent on their own subreddit, and individual users are actively leaving. The Pro plan ($10) is now GPT-4o only, GPT-5.2 is being deprecated, and the product is repositioning as enterprise-exclusive. (u/bierundboeller, u/programmingstarter)

  7. Vibe coding monthly costs range from $0 to $350+, with the median active builder spending $100-200/month. The community is beginning to recognize compulsive usage patterns, with one user downgrading their plan specifically for health reasons. (u/Fra_Lia, u/parboman)

  8. The meta-tool ecosystem is the fastest-growing segment of AI coding. Cost reducers, model routers, usage trackers, and agent orchestrators now dominate builder activity, reflecting that the tooling around AI coding is less mature than the models themselves. (u/reddefcode, u/EfficientAnimal6273, u/rair41)