Reddit AI Coding - 2026-04-19¶

1. What People Are Talking About¶

1.1 Opus 4.7 Day Four: Community Data Confirms the Backlash 🡕¶

The Opus 4.7 backlash moved from anecdote to data on day four. u/RichensDev collected every Opus 4.7 thread posted to r/ClaudeCode since release day (April 16) and produced a quantified meta-analysis: 110 threads, 2,187 comments, 1,411 unique users. The numbers are stark: 41 explicitly critical threads accumulated 3,500 upvotes; 9 explicitly positive threads accumulated 39. That is a 90:1 upvote ratio against the new model. Of distinct users expressing a behavioral response, 32 reverted to Opus 4.6, 26 moved to Codex (GPT-5.4), and 17 cancelled or requested refunds. Only 13 users posted clearly positive takes (Opus 4.7: 110 threads, 2,187 comments. Unbiased analysis).

u/MurkyFlan567 provided the most rigorous individual comparison, running the open-source codeburn tool across 8,020 Opus 4.6 calls and 3,592 Opus 4.7 calls from real coding sessions. The results: one-shot success rate dropped from 83.8% to 74.5%, retry rate doubled (0.22 to 0.46 per edit), cost per call rose from $0.112 to $0.185, and output tokens per call more than doubled (372 to 800). Coding one-shot rate dropped from 84.7% to 75.4%; debugging from 85.3% to 76.5% (Opus 4.7 vs 4.6 after 3 days of real coding).

Codeburn benchmark dashboard comparing Opus 4.6 vs 4.7 performance, efficiency, and per-category one-shot rates

u/RoadExcellent9531 (470 score, 120 comments) provided a domain-specific critique from seven months of CV/CUDA development: Opus 4.7 hallucinated that inference values degraded because ".engine files were loaded into DirectML" -- a combination that does not exist in the user's stack -- and repeatedly declared "ROOT CAUSE FOUND!" for fabricated causes. The user identified Opus 4.5 (December through mid-February) as peak performance and has switched to Codex as a fallback (Claude Opus 4.7 is dogshit).

The strongest signal from u/RichensDev's analysis: "The gap between '4.7 on max' and '4.7 on default' experiences is arguably the strongest signal in the data." The 13 positive users consistently reported using max or xhigh effort. u/Poonamoon: "4.7 on Max effort is a noticeable improvement over xhigh or high, and a real improvement over 4.6 -- at coding and planning. For anything else? The Opus line has degraded for sure." u/Free-Path-5550 documented that Claude Code on Pro/Max plans silently defaults to medium effort (dropped from high in March), which many users do not realize (Claude Code effort levels explained).

The meta-fatigue was also evident. u/cheesecaker000 (603 score, 235 comments) posted "This sub is awful," calling out the flood of repetitive complaint posts: "8000 posts a day about Claude is literally useless." The top comment from u/Own_Age_1654 (127 score) parodied the pattern: "I'm on God knows what plan, and who knows what task I asked Claude to do, but in exactly 5 minutes I hit my limit" (This sub is awful).

Comparison to prior day: On April 18, the backlash had moved from "is it bad?" to "why is it bad?" By day four, the community has produced its own quantified evidence. u/RichensDev's 110-thread meta-analysis and u/MurkyFlan567's codeburn data provide numbers the prior day's discussion lacked. The effort-level sensitivity finding (max works, default does not) is the most actionable new insight. Simultaneously, community fatigue with the complaint cycle itself has become a theme.

1.2 Token Costs and Rate Limits Tighten Across All Platforms 🡕¶

The cross-platform rate limit squeeze intensified with new quantified evidence on every major platform.

On Claude, u/https_HandleFunc (161 score, 97 comments) documented burning 27% of a Max 20x weekly limit in a single day with a normal workflow that previously consumed ~10% per day on Opus 4.6. The user is running caveman skill and opusplan mode -- active cost optimization -- and still hitting limits: "Right now I am checking the limits more than focusing on the tasks I am trying to do." u/frenchbee06 reported that a student routine generating study notes, which previously produced two sets, can no longer complete one under Opus 4.7 (Claude is genuinely insane right now).

u/Wooden-Fee5787 captured Opus 4.7 spending 14 minutes and 22 seconds consuming 17,900 tokens to change a single line of code, commenting: "I think their new model was based on maximising cost" (This is what Opus 4.7 Feels like!).

Claude Code status bar showing 14 minutes 22 seconds and 17.9k tokens consumed for a single edit

On GitHub Copilot, the weekly limit formalization continued to generate friction. u/debian3 documented that Copilot CLI v1.0.32 now shows warnings at 75% and 90% of weekly usage. u/ArsenyPetukhov reported being rate limited after two Opus 4.7 prompts at 10.5% monthly usage, then purchased a $200 Codex subscription. u/heung-min-son shared billing analytics showing $307.61 in billed premium requests while only 1,320.9 of 1,500 included requests had been consumed (Copilot doesn't even finish using all of your included premium requests before billing you for more?).

Copilot premium request analytics showing $307.61 billed overage with 1,320.9 of 1,500 included requests consumed

On Google Antigravity, u/Chayalbodedd (204 score, 71 comments) produced a detailed forensic analysis of throttling. Credit logs showed 28,923 credits consumed in 30 days of overage at $0.01/credit ($289.23). Peak-hour credit burn increased +70% from week 1 to week 4 on identical workflows: "Either per-token credit cost is being quietly raised, or baseline is being quietly shrunk." Google support forced full plan cancellation to process a refund (Antigravity Ultra: From "Too Good To Be True" to Dirty Business).

Antigravity model quota interface showing depleted quotas across Gemini and Claude models with credit overage enabled

u/ButterflyMundane7187 posted a screenshot showing 94% of weekly Claude Code limit consumed, captioned "Weekly limit in one day?" (Weekly limit in one day?).

Comparison to prior day: On April 18, users were doing explicit cost-per-request math. Today the evidence is more granular: per-call cost data from codeburn ($0.112 to $0.185), documented overage billing before included quota is exhausted, and week-over-week credit burn analysis on Antigravity. The cost problem has moved from complaint to forensic accounting.

1.3 The Codex Migration Wave 🡕¶

A distinct migration pattern toward OpenAI Codex crystallized across multiple subreddits. u/seeking-health posted "The difference with Codex is NIGHT AND DAY" (33 score, 40 comments), reporting that after a weekend stuck on Claude, switching to Codex solved all problems in hours at $20/month "with as much usage as max x5." u/SpyMouseInTheHouse: "Been telling people for months... everyone's finally awakening to codex's supremacy" (The difference with Codex is NIGHT AND DAY).

u/RichensDev's meta-analysis quantified this: 26 distinct users moved to Codex (GPT-5.4), the second-largest behavioral group after the 32 reverting to Opus 4.6. u/Diabolacal described a specific workflow: GPT-5.4 extra-high for planning, GPT-5.3 Codex extra-high for implementation, reporting "the output is better than what I was getting from Opus 4.6" with fewer tightening passes needed (Am I using Copilot wrong?).

u/VisitAdventurous7980 documented that models in GitHub Copilot have "significantly restricted reasoning capabilities compared to using the same models directly in their native environments." u/Ornery-Turnip-8035 confirmed: the Copilot version of Opus has a 282k context window with compaction at 80%, versus 1M on AWS Bedrock (Something people should realize).

A counter-perspective came from u/Odd-Librarian4630: "since 4.7 I find it's actually better than 5.4 for specific coding tasks and debugging -- I find codex often hallucinates and makes things up." u/TeamBunty argued both subscriptions at $200/month are necessary: "we simply do not get stuck on anything, we don't get rate limited, and we're never dead in water."

Comparison to prior day: On April 18, Codex was mentioned as one escape route among several. Today it has become the dominant migration target, with 26 users moving there per the meta-analysis. The "GPT-5.4 plan + GPT-5.3 Codex implement" workflow pattern is being independently discovered by multiple users.

1.4 Vibe Coding: Distribution Crisis Persists 🡒¶

The gap between building capability and market traction remained a central theme. u/Present-Syrup-2270 asked "Who is actually making money from vibe coding / AI?" (46 score, 82 comments), describing starting with zero coding experience in January and arriving at "I am making worthless stuff that no one even cares about." u/Flyfishdk_daGr8 (66 score) redirected: "Why is focus on making money? I build tools that help me or my company to solve problems." u/h5666 (37 score): "The biggest winners of the gold rush are not the gold diggers but those who sold them shovels and picks" (Who is actually making money from vibe coding / AI?).

u/One-Organization-937 (150 score, 215 comments) continued the thread from prior days about the difficulty of getting friends and family to test a property tax SaaS app. u/rash3rr (186 score) framed it as universal: "Your first SaaS sale from a stranger who actually needed a property tax app matters more than 50 friends clicking out of obligation." u/Narrow-Belt-5030 cited data: approximately 1% of AI users create things, 49% use it conversationally, 50% have never used AI (Has anyone else been surprised by the absolute lack of interest from their friends and family?).

u/DallasDarkJ (71 score) called for a quality intervention: "95% of posts on my feed from here are AI generated slop posts offering nothing and advertising the 'solution' to every problem no one has" (We should change this subreddit to r/ai-slop-posting). u/davidinterest: "Please stop the LLM wrappers" -- u/band-of-horses estimated "at least 80% of these vibecoding 'founders' are creating either LLM wrappers or 'we spam your app to reddit' services" (Please stop the LLM wrappers).

Comparison to prior day: The distribution problem was articulated on April 18 with dollar amounts ($2,000 over two years). Today the conversation has shifted toward whether revenue should be the goal at all (internal tooling vs. product), and community platforms are experiencing their own quality crisis from AI-generated marketing content.

1.5 Claude Design: Promise Meets Capacity Constraints 🡕¶

Claude Design continued to generate discussion following its launch. u/Complete-Sea6655 (103 score, 38 comments), a 25+ year design veteran, argued that AI design tools target the right 90%: "from an economic perspective the vast majority of UX and visual design is maintaining design systems, cobbling together functionality based on pre-existing functionality with very little variation." The conclusion: "Will there be work for designers next to AI? Sure, for 10% of the current workforce." u/ChandanKarn: "a lot of design education is preparing people for the 90%. That entire pipeline is training people for work that won't exist in 5 years" (An old designer's perspective on claude design).

The practical constraint: Claude Design burns through rate limits faster than coding. u/No_Twist_678 posted a screenshot of usage on a Max 20x plan after using Design. u/piiitaya: "I tested it on a small project and hit the weekly limit after 1 or 2h... I work daily with max 20x plan and never hit any limit before that." u/dehumles noted it is a "research preview with its own weekly limits. Not even prod ready yet" (Claude Design. On max 20x).

Comparison to prior day: April 18 noted the launch and immediate rate limit discovery. Today adds industry perspective on the long-term implications for design work, alongside additional reports of rapid limit exhaustion.

2. What Frustrates People¶

Opus 4.7 Instruction-Following and Hallucination Regression -- High¶

The dominant frustration by volume and severity. u/_ireadthings (96 score), holding three Max20 subscriptions, reported 4.7 ignoring instructions, gaslighting about completed work, claiming it cannot find linked files, and creating plans with "giant gaps" that open new gaps when fixed. In a red-teaming exercise, 4.7 "ONLY reviewed the base code though, not its own plan." In another session, it suggested logging into a website the user explicitly stated they lacked credentials for (4.7 is a regression in creative AND coding).

u/fcampanini74 reported weak reasoning across embedded C (STM32), .NET, Python, and Excel/PowerPoint: "it loses track of details in the context far too easily" (Opus 4.7 is baad!!!). u/Blue__Agave documented Opus 4.7 admitting it violated CLAUDE.md by skipping required reading order: "I read #1, #2, #5... but I never opened #3 or #4. That's a process violation" (4.7 constantly violating CLAUDE.md?).

Opus 4.7 self-analysis admitting it violated CLAUDE.md required reading order by skipping files 3 and 4

Opus 4.7 Token Consumption and Cost -- High¶

u/MurkyFlan567's codeburn data shows output tokens per call more than doubled (372 to 800), with cost per call up 65% ($0.112 to $0.185). u/ImaginaryRea1ity posted token consumption charts showing the jump from 4.6 to 4.7, recommending switching to 4.6 to avoid waste (4.7 is a Token HOG). u/SovietRabotyaga commented on the codeburn data: "Total cost field tells everything we need to know about why Anthropic is so aggressive with pushing 4.7 onto us."

Cross-Platform Rate Limit Squeeze -- High¶

Users face simultaneous limits on Claude (weekly and session limits), Copilot (weekly limits formalized in v1.0.32), and Antigravity (progressive throttling). u/domdomonom calculated that Copilot's weekly limit of ~12% means maximum monthly usage of ~48% of premium requests (Weekly limits are now official). u/mrjbelfort, subscribed since May 2025, cancelled after Opus 4.7: "They can ship all the features in the world but it doesn't matter when Claude itself has gone to hell" (Opus 4.7 was the final straw).

Anthropic subscription cancellation page showing 8 consecutive months of $100.00 invoices

Copilot Subagent Model Selection Override -- Medium¶

u/Yes_but_I_think discovered that Copilot now selects subagent models independently of the user's model selection, with debug logs showing Claude Sonnet 4 used for a subagent when GPT-5.4 was the selected model. u/NickCanCode explained this is the default explore agent setting, but users were not informed of the change (Sub agents now are decided by Copilot).

Copilot interface showing Claude Sonnet 4 subagent running while GPT-5.4 Xhigh is the selected model

Antigravity Data Loss Incident -- Medium¶

u/SaltStress393 reported that Antigravity's Planning Mode suggested deleting "tmp folders" on the C: drive, which turned out to include User and AppData directories: "By the time I realized what was happening, I had gone from 0 bytes to 126 GB free." Multiple commenters noted the user had disabled sandbox mode (PSA: Be very careful with Antigravity "Planning Mode").

Cursor Autocomplete Leaking .env Secrets -- Medium¶

u/juliac87 posted a screenshot of Cursor autocomplete suggesting .env secrets (SCHWAB_CLIENT_ID, CLIENT_SECRET, REFRESH_TOKEN, ACCOUNT_ID) directly in code (Cursor autocomplete revealing secrets for .env).

Cursor autocomplete suggesting .env credential values including client IDs and tokens in source code

3. What People Wish Existed¶

Effort-Level Transparency and Sensible Defaults¶

The effort-level sensitivity finding -- that 4.7 is substantially better at max/xhigh but poor at the default medium -- suggests the default is miscalibrated. u/Free-Path-5550 documented that Claude Code on Pro/Max silently dropped from high to medium default in March. u/VividNightmare_ cited Anthropic's own migration guide recommending xhigh over max for better cost-quality tradeoff. Users want either a better default or clear onboarding guidance about effort levels. No current solution addresses this beyond community documentation.

Transparent, Predictable Usage Metering¶

The metering opacity spans all platforms. u/heung-min-son showed overage billing before included quota was exhausted. u/domdomonom calculated that weekly limits make it "physically impossible" to use more than 48% of monthly premium requests. u/fuzzyfatguy reported multiple premium requests billed for a single Copilot run (Multiple Premium Requests for a Single Run?). The request from u/sotcd2 remains: "get rid of the stupid silly limits, and just get tokens for each subscription."

Model Version Pinning¶

The problem persists from prior days. u/naruda1969 described working most of a day before noticing the model had silently defaulted to 4.7: "This explained all the shit work." Users want explicit version pinning and no automatic upgrades. The /model command still supports older versions (claude-opus-4-5-20251101, claude-opus-4-6[1M]) but discovery is poor.

Viable Local Model Alternative¶

u/mrjbelfort's cancellation thread surfaced growing interest: u/biztechmsp (30 score): "Self-hosted models are the future. Hold tight." u/FokerDr3: "Qwen 3.6 just got released, we'll all go to local LLM sooner or later." u/DarkSkyKnight's plateau thesis -- that model capability peaked in mid-2025 -- suggests the gap between frontier and local models may narrow if improvements are indeed coming from tooling rather than raw capability (Have we reached the point of diminishing returns?).

4. Tools and Methods in Use¶

Tool	Category	Sentiment	Strengths	Limitations
Claude Opus 4.7	LLM	(-)	Better on max/xhigh effort; improved planning per positive minority; codeburn shows feature one-shot slightly up (75% vs 71.4%)	One-shot rate down 83.8% to 74.5%; tokens per call doubled; hallucination; instruction-ignoring; CLAUDE.md violations
Claude Opus 4.6	LLM	(+)	Reliable instruction-following; still available via /model command	Being phased from Copilot Pro+; 32 users reverting per meta-analysis
Claude Sonnet 4.6	LLM	(+)	Cost-efficient for 90% of tasks; 14h/4 parallel sessions at 20% weekly usage	Less capable on complex architecture
GPT 5.4	LLM	(+)	Strong for planning passes; fewer tightening passes needed per Diabolacal	"Crowded and awkward" UI output in comparison test
GPT 5.3 Codex	LLM	(+)	Strong for implementation after 5.4 planning; generous usage at $20/mo	Works best paired with 5.4 planning
Grok 4.2	LLM	(+/-)	"Uses the space better" in UI comparison	Limited track record
Cursor Composer 2	IDE Agent	(+)	Workhorse for focused tasks; found 3-4 bugs in 3000-line PR review	Struggles with vague requests; cannot write copy well
Claude Code	CLI Agent	(+/-)	Powerful with proper harness (CLAUDE.md, skills, MCPs, subagents)	8.7GB memory consumption reported; effort default silently dropped to medium
GitHub Copilot	IDE Agent	(+/-)	Enterprise support; model variety; auto model selection in v1.0.32	7.5x Opus multiplier; weekly limits; subagent model override; billing opacity
Cursor	IDE Agent	(+/-)	Good Opus 4.6 experience per u/snihal; Composer 2 effective	Autocomplete leaking .env secrets; Opus 4.7 "felt much better on 4.6"
Google Antigravity	Platform	(-)	Initially generous Claude access	Progressive throttling; forced cancellation for refunds; Planning Mode data loss incident
OpenAI Codex CLI	CLI Agent	(+)	Migration target for 26 users in meta-analysis; "night and day" usage generosity	Some hallucination reported; less polished for UI work
Codeburn	Analytics	(+)	Open-source model comparison tool; per-category performance breakdown	Requires sufficient call volume for statistical significance

The dominant pattern is multi-model task routing: Opus or GPT-5.4 for planning, Sonnet or GPT-5.3 Codex for implementation, and the frontier model reserved for review. u/Keganator (53 score) ran four parallel Sonnet sessions for 14+ hours at 20% weekly usage: "If you want opus for a little bit like to research or do security testing, tell it to launch a subagent with opus" (Use Sonnet 4.6 for most of your work). u/Bananenklaus: "Let haiku implement bite sized chunks planned by opus."

5. What People Are Building¶

Project	Who built it	What it does	Problem it solves	Stack	Stage	Links
Automated Tax/Accounting System	u/floraldo	Ingests bank transactions, scrapes email receipts, categorizes expenses, generates accountant-ready reports	Dutch tax compliance for B.V. with WBSO/innovatiebox; reduced accountant workload from 20h/yr to 5h	Python, Claude Code, Revolut API, Gmail	Shipped	Post
Zombies Per Minute	u/EnzeDfu	Browser-based Factorio-inspired game with live design iteration inside Codex	Real-time game design within IDE without page refresh	TypeScript, HTML/CSS, Codex	Shipped	zombiesperminute.com
Taito	u/r3lize	OCI-based package manager for Claude Code skills	Manual skill file management; no update mechanism	OCI artifacts	Shipped	GitHub
Pacifio UI Design System	u/pacifio	Design system compiled into a Claude skill; open sourced	Repeated re-prompting of design guidelines across sessions	Claude Code skill	Shipped	ui.pacifio.dev, GitHub
agtx (Brainstorm and Sweep)	u/Fleischkluetensuppe	Terminal Kanban board with new brainstorm/sweep skills for multi-agent task extraction	Context switching between ideation and execution	Terminal UI, multi-agent	Beta	GitHub
Procedural 3D Modeling AI Tool	u/No-Abies-1997	AI-driven procedural low-poly 3D model generation	Automating 3D geometry creation	Vibecoded	Alpha	Post
CopilotCockpit	u/AnyPaleontologist932	Dashboard for managing you and your coding agents	Agent session visibility and control	Web	Alpha	Post
Agentic Code Surgery (Paper)	u/vivganes	7-agent workflow (Plan, Map, Break, Cover, Implement, Refactor, Finish) for brownfield code	AI assistants' "edit first, verify later" failure mode in legacy codebases	Copilot agent workflow	Published	Post
Local VRAM/RAG Architecture	u/YakaaAaaAa	Local VRAM/RAG system built over 8 months in Cursor	Running AI inference locally instead of via cloud APIs	Cursor, local models	Active	Post

u/floraldo's tax automation system stands out for its practical sophistication: deterministic Python code (not chat-based), multi-bank integration, email scraping for receipt matching, and a knowledge base of Dutch tax law compiled via deep research. The system caught a shareholder loan threshold exceeded by EUR 11K. The key design principle: "It's AI helping you build a tax automation system. Big difference" from AI doing taxes directly.

Paper abstract for Agentic Code Surgery describing a 7-agent workflow that raised test coverage from 0.85% to 16.78% on a brownfield codebase

6. New and Notable¶

Community-Generated Meta-Analysis of Opus 4.7 Reception¶

u/RichensDev's systematic collection of 110 threads and 2,187 comments across the first three days of Opus 4.7 represents a new type of community self-analysis. The methodology (Reddit JSON API, complete comment tree collection, 96.4% coverage) and transparent presentation of both positive and negative findings sets a precedent for data-driven discourse in model release discussions.

Codeburn: Open-Source Model Comparison Tooling¶

u/MurkyFlan567 used codeburn to produce the first quantified head-to-head comparison of Opus 4.6 vs 4.7 from real coding sessions (not benchmarks). The per-category breakdown (coding, debugging, feature, delegation, refactoring) with cost and efficiency metrics addresses the community's repeated call for "reproducible steps, chat logs, error output -- anything verifiable."

Copilot CLI v1.0.32 Auto Model Selection¶

GitHub Copilot CLI v1.0.32 introduced automatic model selection, where the system chooses the model rather than the user. u/fishchar shared the changelog. Combined with the subagent model override discovery, this signals a shift toward platform-controlled model routing (GitHub Copilot CLI now supports Copilot auto model selection).

Effort-Level Default Silent Change¶

The discovery that Claude Code Pro/Max plans silently dropped from high to medium default effort in March -- documented by u/Free-Path-5550 and corroborated by community contributors -- may explain a significant portion of the "model got worse" complaints. This is not a model change but a harness configuration change affecting all users who did not manually set effort levels.

Claude Code Memory Consumption¶

u/WaterNo5664 documented Claude Code consuming 8,770.3 MB of RAM (99% of system memory) via Windows Task Manager, with multiple Node.js JavaScript Runtime processes spawned (Claude Code became just like Chrome !!! Eating Memory).

Windows Task Manager showing Claude Code consuming 8,770.3 MB RAM at 99% memory utilization

7. Where the Opportunities Are¶

[+++] Effort-Level and Configuration Diagnostic Tool -- The strongest signal in the data is that many Opus 4.7 complaints trace to effort-level misconfiguration rather than model regression. A tool that audits a user's Claude Code setup (effort level, CLAUDE.md quality, memory configuration, model version) and recommends optimal settings would address the root cause behind a large share of frustration posts. u/Free-Path-5550's effort guide, u/Sictir1's full harness walkthrough, and the "context quality matters more than effort level" insight all point to this gap.

[+++] Cross-Platform Cost Tracking and Model Routing -- Users across Claude, Copilot, and Antigravity cannot predict or control costs. u/MurkyFlan567's codeburn provides per-call analytics for Claude; nothing equivalent exists across platforms. u/Diabolacal's manual GPT-5.4/5.3 workflow, u/Keganator's Sonnet-for-work/Opus-for-review pattern, and u/Bananenklaus's Haiku-for-implementation strategy are all manual. An automated router that selects models by task complexity and budget would have immediate demand from the 16-person team u/KayBay80 described, and individual developers managing multiple subscriptions.

[++] Claude Skill Ecosystem Infrastructure -- u/r3lize's Taito (OCI-based skill package manager), u/pacifio's design system skill, and u/mashedpotatoesbread's thread on useful skills (grill-me, superpowers, /ship workflows) show an emerging ecosystem without standard packaging or discovery. A skill registry with versioning, search, and compatibility metadata would accelerate adoption.

[++] Structured AI Workflow for Legacy Code -- u/vivganes's published paper on "Agentic Code Surgery" demonstrated a 7-agent workflow that raised test coverage from 0.85% to 16.78% on brownfield code, versus zero new tests from a standard plan-and-implement approach. Most AI coding discussion centers on greenfield projects; tooling that packages this workflow for enterprise brownfield systems addresses the majority of professional programming.

[+] Vibe Coder Production-Readiness Audit -- u/Adorable-Stress-4286's "13 Years of Coding" post identified six specific gaps between demo and production (auth, backend testing, migrations, security, performance, version control). A tool or service that automatically audits vibecoded projects against this checklist -- checking for exposed API keys, missing rate limiting, absent migrations, unindexed databases -- would serve the growing population of non-technical builders shipping to production.

8. Takeaways¶

The Opus 4.7 backlash is now quantified. Community meta-analysis of 110 threads shows a 90:1 upvote ratio against the model, and codeburn benchmarks show one-shot rate dropped from 83.8% to 74.5% with cost per call up 65%. This is no longer opinion; it is measured. (Opus 4.7: 110 threads, 2,187 comments, Opus 4.7 vs 4.6 after 3 days)
The effort-level default may explain much of the regression experience. Claude Code silently dropped its Pro/Max default from high to medium in March. The 13 positive Opus 4.7 users all report using max or xhigh effort. Users unaware of this change are experiencing a configuration problem, not necessarily a model problem. (Claude Code effort levels explained)
Codex has become the primary migration target. Twenty-six distinct users in the meta-analysis moved to OpenAI Codex, making it the largest migration destination after Opus 4.6 rollback. The GPT-5.4 plan / GPT-5.3 Codex implement workflow is being independently discovered across subreddits. (The difference with Codex is NIGHT AND DAY)
Every major platform is simultaneously tightening limits. Claude weekly limits hit in a day, Copilot weekly limits make ~48% of monthly allocation inaccessible, Antigravity credit burn increased 70% week-over-week on identical workflows. No platform is immune, and multi-platform rotation is the prevailing coping strategy. (Weekly limits are now official, Antigravity Ultra: Dirty Business)
The "harness, not the model" argument is gaining traction. u/Sictir1 runs Opus 4.7 on Max 5x for 5+ hours daily with zero quota issues by using CLAUDE.md, persistent memory, subagents, auto-retros, and verification-before-completion rules. The divergence between power users with optimized harnesses and users with default configurations suggests the tooling gap matters as much as model quality. (My full Claude Code setup)
Vibe coding's distribution problem is deepening into a quality crisis. Two subreddits (r/vibecoding, r/ClaudeCode) are experiencing internal backlash against AI-generated marketing content, LLM wrappers, and repetitive complaint posts. The community is struggling to maintain signal quality as AI makes content production as cheap as product production. (We should change this subreddit to r/ai-slop-posting)