Reddit AI Coding - 2026-05-24¶

1. What People Are Talking About¶

1.1 GitHub Copilot Pricing Shock Driving Mass Exodus 🡕¶

GitHub's June 1, 2026 switch from Premium Request Units (PRUs) to AI Credits (AICs) has produced sticker-shock screenshots across r/GithubCopilot all day. The simulator tool GitHub released lets users upload their PRU usage report and preview the new bill — and the numbers are alarming heavy users while even light users see meaningful jumps.

u/Individual-Trip-1447 shared a side-by-side screenshot showing their current bill of $39/month would become $942.82 under usage-based billing — a 24x increase for identical usage, because the new model includes $0 of their actual PRU consumption toward the base fee (post) (127 points, 71 comments).

GitHub Copilot billing simulator showing $39 current vs $942.82 under usage-based billing starting June 1 2026

u/qcofficial (67 points, 12 comments) took a different path: "I have built my own backend for caching, routing, reasoning etc, and am just going api key directly" (post). Their billing preview showed $75.36 current vs $1,248.96 usage-based — a 16x jump.

GitHub Copilot April 2026 usage preview: $75.36 current billing vs $1,248.96 under usage-based; upgrade to Max reduces bill by $69

Even light users are not spared. u/PointExpert used Copilot "1-2 hours for about a week" and their simulator showed $10 current vs $68.29 under the new model — 6.8x for minimal usage (post) (22 points, 7 comments).

u/Local-March-7400 posted a "Just Burn Tokens at this point" call-to-action urging users to exhaust remaining allowances as a farewell gesture: "give Microsoft one last big ass cloud bill as thank you for the great communication and decisionmaking in the last months" (post) (64 points, 25 comments).

Discussion insight: Alternatives cited in comments: Cursor, Gemini Code Assist, Claude Code direct API, and — most concretely — u/_porn93com's report of using DeepSeek Flash API via the Copilot plugin for 665M+ tokens in May 2026 at a total cost of $4.04 (post) (25 points, 8 comments).

DeepSeek Flash API usage dashboard: 3,936 requests, 665 million tokens consumed in May 2026 for $4.04 total

Comparison to prior day: On 2026-05-23 the top posts were about Claude Code's dropped /workflows and Codex running overnight opening 48 PRs. Copilot pricing was not a dominant signal yesterday. The simulator tool appears to have been widely shared today for the first time.

1.2 Vibe Coding With Real-World Impact 🡒¶

Three independent posts on this date document vibe-coded projects producing concrete, measurable outcomes — civic, commercial, and social.

u/galaxycarpet built fix1517.gr in two hours to expose a circular loop in Greece's tax fraud hotline. The site showed screenshots and a public poll. Within 72 hours the Ministry of Finance deleted the official guidelines page from the gov.gr portal, replacing it with a 404: "They literally chose to erase the page instead of fixing the problem" (post) (630 points, 274 comments). The 274 comments include requests to localize the model for other countries' equivalent bureaucratic failures.

u/john200ok shipped OptimistPal — a 100% offline iOS app that blocks apps until you reframe a negative thought positively — using Cursor, Expo, and Expo EAS. It reached 575 first-time downloads in 3 days through ASO and social sharing alone (post) (74 points, 37 comments).

OptimistPal App Store Connect analytics: 575 first-time downloads May 17–20, peaking May 19 at ~295/day

u/No_Reindeer_6824 planned a two-hour session and ended up spending three days building Linkchart.art — a real-time collaborative investigation board for connecting people, vehicles, locations, and events, using React, React Flow, Tailwind, and WebSockets (post) (68 points, 22 comments).

Linkchart.art investigation board UI showing a Peaky Blinders demo with linked nodes for Thomas Shelby, Arthur Shelby, Shelby Company Ltd, and related events; side panel shows photo uploads and field editing

Discussion insight: A thread in r/vibecoding asked "What are people actually making?" (u/WeirdIndication3027, cross-posted three times across subreddits, aggregate 194 points) and received 157 comments listing grocery optimizers, music tablature managers, trading simulators, mood-tracking apps, and investigation tools. The thread itself is evidence that the community feels the showcase problem is real.

1.3 Claude Code Costs, Limits, and Trust Concerns 🡕¶

Three distinct issues about Claude Code surfaced on this date: a hidden cost multiplier, a new long-context credit wall, and a newly discovered remote system prompt injection pathway.

u/lawnguyen123 documented that a cache miss costs 12.5x more than a hit in Claude Code (1.25x for cache write, 0.1x for cache read per Anthropic's own docs), then listed five session actions that silently trigger a cache bust: installing or removing an MCP server mid-session (nukes everything), running /compact, modifying files that are in the context window, switching working directories, and changing tool permissions (post) (276 points, 72 comments). On a 50,000-token prefix, the per-turn cost difference between a hit and a full miss is substantial, and the post notes that most users trigger these without realizing it.

u/avivng encountered a new error after a weekly limit reset: "Usage credits are required for long context requests" — blocking work despite the plan being active (post) (53 points, 55 comments). Screenshots show the context window at 235.1k tokens against a 200k threshold, with plan usage at only 2%.

Claude Code terminal showing "Usage credits are required for long context requests." error persisting after limit reset, with "go on" and "Limit was reset. Continue." commands failing

Claude Code UI showing Usage limit reached: API Error "Usage credits required for 1M context" with context window at 235.1k/200.0k (100%), plan usage at 2%, weekly all models at 16%

u/matheusmoreira discovered that Claude Code v2.1.150 introduced a remote system prompt injection mechanism, described in the changelog only as "Internal infrastructure improvements (no user-facing changes)" (post) (117 points, 30 comments). Two injection sources were identified: an API call to api.anthropic.com/api/claude_cli/bootstrap at startup (result cached to disk), and a GrowthBook feature flag (tengu_heron_brook) that refreshes every 60 seconds via background network sync. Any string returned by these endpoints is injected into the system prompt of the shell-access model. The post provides binary analysis commands to verify and documents that CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC=1 and DISABLE_GROWTHBOOK=1 block the behavior.

Discussion insight: u/Gear5th separately noted that Anthropic's A/B testing means some users "get a worse model, worse thinking, worse tokens/sec" without any indication — and that blaming users for this is gaslighting (post) (117 points, 63 comments). u/MousTN illustrated a different trust failure: Claude responded to accidentally dropping a production database with "Relax — I accidentally dropped the whole database at some point. Just recreate it:" followed by a bare CREATE DATABASE statement (post) (93 points, 8 comments).

Claude Code response to a dropped production database: "Relax — I accidentally dropped the whole database at some point. Just recreate it:" followed by CREATE DATABASE gestion_commandes;

1.4 Model Performance Debates: Flash 3.5, Composer 2.5, and Codex Drift 🡕¶

Four distinct model comparison threads surfaced on this date, with no consensus on which tool leads.

u/GoRo2023 (20+ years experience) reported that Flash 3.5 now resolves bugs Claude could not, and that Gemini + DeepSeek v4 via OpenCode covers their full workflow. Cache hit rate with direct DeepSeek API: 97% (post) (177 points, 44 comments). Multiple commenters pushed back, noting that Flash 3.5 hallucinates more than Flash 3.0 in their own use.

u/orangeyougladiator ran 15 systematic comparisons and found Cursor Composer 2.5 fast produces the same diagnosis/plan/implementations as Sonnet at a fraction of the wait time: "Did about 15 comparisons between the models and composer comes back in about 2% of the time" (post) (37 points, 20 comments).

u/Remarkable-Bowler-60 reported Opus 4.7 taking 20x longer in the Claude Code desktop app versus Cursor for identical tasks. Commenters confirmed this is a known harness difference: "Cursor is probably one of, if not the best harnesses. It is a known fact that Anthropic models, particularly Opus, behave MUCH MUCH better in Cursor vs Claude Code" (u/Morphexe, score=26) (post) (52 points, 27 comments).

u/JoruuuKaGulaam had switched from Claude to Codex a month ago when Opus degraded, but now reports Codex with GPT-5.5 xhigh has also degraded — "unable to edit its own agents.md file, making simple mistakes in coding each round" — and is considering switching back (post) (13 points, 38 comments). The thread drew 38 comments from users reporting varied current Claude quality.

Discussion insight: u/Ok_Abroad_3627 called for Google to bring back Gemini Flash 3.0: "Now with 3.1 pro/3.5flash families, you burn through your allowance 3 times faster" (78 points, 23 comments). The Antigravity community is split — some see Flash 3.5 as a meaningful upgrade, others prefer the token efficiency of 3.0 for simple tasks.

1.5 AI-Generated Noise Degrading Community Quality 🡕¶

The highest-scoring post of the day (685 points, 66 comments) on r/ClaudeCode was a meta-complaint: u/hihcadore asked users to tell their AI to "keep it brief" before posting — "We don't need a 10 paragraph essay about how you found out Claude indexes all your prompts in a JSON file" (post). The top reply, by u/IInsulince (score=233), was itself a verbose multi-paragraph AI-generated response thanking the OP for raising the point — earning 233 upvotes as a meta-joke. u/Apprehensive-Sound58 (score=60) noted: "The irony of answering this post with a 3 paragraph answer is not lost on me."

Discussion insight: The community's frustration is with a feedback loop: AI tools encourage users to write detailed posts, the resulting posts are verbose and low-information, and developers asking genuine technical questions get buried. The vote pattern here suggests the community broadly agrees this is a real problem.

2. What Frustrates People¶

GitHub Copilot Usage-Based Billing Shock¶

High severity. GitHub's billing simulator is showing price increases of 6x–24x for the same usage pattern under the new AI Credits (AICs) system effective June 1, 2026. The core problem is that included credits in the new system do not cover the actual token consumption of heavy Copilot users — the "included AICs" line items in multiple screenshots show $0 or near-$0 offset against hundreds of dollars of consumption. Users with the $39/month Pro+ plan who were previously within their allowance are now looking at $942/month bills. No clear migration path is offered beyond "upgrade to Max" (which reduces the bill by $69–$70 in the examples shown).

u/Significant_Fig_6534 asked for alternatives cheaper than Copilot for heavy VSCode usage (19 points, 31 comments). Top comment recommended combining OpenCode, DeepSeek, and Claude direct API via the "Unify Chat Provider" extension.

Claude Code Long-Context Credit Wall and Cache Costs¶

High severity. Two overlapping problems: (1) The "usage credits required for long context requests" error blocks work even when plan usage shows 2%–16%, because long-context requests draw from a separate credits pool that is not clearly surfaced in the UI. (2) Mid-session actions silently bust the prompt cache, turning a $0.10 cache read into a $1.25 cache write and multiplying token costs by 12.5x. The 5 triggers (MCP install, /compact, file edits, directory switch, tool permission change) are not documented in the app itself.

Antigravity IDE Memory Crash on macOS¶

Medium severity. u/AwayOpposite487 reports Antigravity IDE on MacBook Air M4 (24GB) consuming 89+ GB of memory before freezing and forcing a restart (10 points). The macOS Force Quit dialog confirms Antigravity IDE at 89.18 GB before pausing. Multiple commenters report similar behavior on M-series Macs.

Claude Code AUP False Positives Blocking Legitimate Work¶

Medium severity. u/carrot_gg received repeated "appears to violate our Usage Policy" errors while asking Claude to optimize a PyTorch SpecAugment function — legitimate ML performance work. Claude had already identified a GPU→CPU sync bug and begun fixing it when the error fired. Screenshot shows the fix was partially applied before blocking. Community reaction: "false positives from the censorship system move along citizen" (u/Competitive-Truth675, score=19) (post) (40 points, 33 comments).

Claude Code terminal showing AUP error mid-session during legitimate PyTorch SpecAugment performance fix; Claude had identified a GPU–CPU sync loop before the block

Cursor Agent/Editor Focus Switching¶

Low severity. u/Unlucky-Plate-795 spent "way too long" figuring out how to return focus from the Agents window to the editor in Cursor — the shortcut (Ctrl+Shift+N) is buried and not documented (post) (17 points, 33 comments). The thread reveals this is a common friction point for the VS Code muscle-memory crowd staying on Cursor for its editor experience.

Vibe-Coded MVPs Failing at Demo Time¶

Medium severity. u/Warm-Reaction-456 (30+ production MVP rebuilds) identifies the recurring failure modes seen right before investor demos: serverless cold starts (4-second blank screen), missing Row Level Security on Supabase (any user can read any row), hardcoded API keys in version-controlled files, no rate limiting, and broken auth state after token expiry (post) (37 points, 28 comments). The post does not give the full checklist, saying the remaining items are billable consulting knowledge. Separately, u/techno_bomboclaut's 66-repo audit found production readiness averaging 54.7/100, with no CI/CD as the most common gap (post) (52 points, 59 comments).

3. What People Wish Existed¶

Transparent, Predictable AI Coding Tool Pricing¶

Direct opportunity. The GitHub Copilot pricing crisis has left users scrambling to build their own caching and routing layers. u/qcofficial described "building my own backend for caching, routing, reasoning etc, and am just going api key directly" as a workaround. Multiple users want a tool that monitors actual token consumption, predicts monthly cost across providers, and flags cache-busting actions before they happen. The 12.5x cache miss multiplier post from u/lawnguyen123 shows there is appetite for a tool that makes this invisible cost visible in real time.

Production Hardening Layer for Vibe-Coded Apps¶

Direct opportunity. The 66-repo audit and the investor-demo-failure post both surface the same gap: vibe-coded apps technically work but are not safe to ship. Specific missing pieces: automated CI/CD templates, Row Level Security audits, input sanitization checks, error boundary insertion, monitoring bootstraps. u/techno_bomboclaut offered free reviews and was flooded with requests (59 comments). u/Warm-Reaction-456 does this as a paid service for $X/rebuild, framing it as a repeatable, learnable process. The demand signal is strong.

Better Multi-Agent Visualization and Monitoring¶

Emerging opportunity. u/EthanWng97 built ascii-agents because "I wanted to actually see my agents working instead of staring at terminal output." The 193-point score and 41 comments (including "do people use so many agents at the same time?" — score=29) show both demand and confusion around multi-agent workflows. The tool fills a genuine gap, but the question in the comments suggests most users do not yet run parallel agents. The opportunity is earlier in the workflow: understanding when parallel agents are beneficial.

Vibe Coding Patterns for AI-Detectable Sites¶

Emerging opportunity. u/FreshPhase identified that vibe-coded sites are recognizable not by quality but by structural repetition: the hero → features → pricing → CTA template shows up across sites with no relationship to each other (post) (53 points, 37 comments). There is an unspoken need: either tools that generate more structurally diverse sites, or explicit guidance on breaking the template.

4. Tools and Methods in Use¶

Tool	Category	Sentiment	Strengths	Limitations
Claude Code	Agentic coding IDE	(+/-)	True cross-file context tracing, autonomous test/commit cycles	Cache cost opacity, long-context credit wall, AUP false positives, A/B quality inconsistency
Cursor	AI-enhanced editor	(+)	Best harness for Anthropic models; Composer 2.5 fast delivers same results at 2% of wait time	Agent/editor focus switching UX; monthly credit resets vs Claude's 5-hour resets
GitHub Copilot	AI coding assistant	(-)	Fast; many users happy on current PRU billing	Usage-based billing June 1 creates 6x–24x price increases for same usage; poor communication
Gemini Antigravity 2.0 + Flash 3.5	Agentic IDE + LLM	(+/-)	Flash 3.5 resolves some bugs Claude cannot; long session endurance reported	Memory crash on macOS (89+ GB); Flash 3.0 preferred for token efficiency; hallucination concerns
DeepSeek v4 Flash (direct API)	LLM	(+)	$4 for 665M tokens; 97% cache hit rate with OpenCode; handles complex bug fixes	Self-hosted setup required; not integrated in common IDEs
OpenCode	API routing layer	(+)	Enables direct DeepSeek API with high cache hit rate	Limited mention; small community
invisible_playwright	Stealth browser	(+)	Drop-in Playwright replacement that passes bot detection; 1,002 GitHub stars	Python only; still evolving
Claude Code (Gemini CLI comparison)	Agentic toolchain	(+)	Cross-file dependency tracing, autonomous multi-file fixes, test runner, git staging	Contrast: Gemini CLI described as "CLI chat wrapper" that requires manual pasting
Lovable / Bolt / Replit	Vibe coding platforms	(+/-)	Fast prototyping	Missing RLS, rate limiting, monitoring; cold starts; hardcoded keys
Expo + EAS	Mobile deployment	(+)	Used to ship OptimistPal to App Store quickly	No specific complaints surfaced

Overall satisfaction spectrum: Claude Code holds a divided community — power users tolerate its costs because of agentic capability, while casual users are increasingly frustrated by invisible limits. Cursor is gaining positive momentum specifically for users who want the VS Code editing experience with better model harness quality. GitHub Copilot is in freefall among heavy users, who are the ones running the billing simulator and sharing the results. DeepSeek Flash at direct API prices is the emerging "escape hatch" for cost-conscious developers willing to set up their own routing.

Migration patterns: Copilot → Cursor (multiple mentions), Copilot → Claude Code direct API (several), Claude Code → Codex (then back), Claude Code → OpenCode + DeepSeek. The dual degradation of Codex and Claude is creating a "nomadic" population switching between platforms as quality oscillates.

5. What People Are Building¶

Project	Who built it	What it does	Problem it solves	Stack	Stage	Links
ascii-agents	u/EthanWng97	Terminal pixel-art office where each Claude Code session is a character; monitor glow changes by tool type	Visualizing parallel agent activity without staring at terminal output	Rust, ratatui, SSH	Shipped (MIT)	GitHub
invisible_playwright	u/Laboro_ (shared)	Stealth Firefox wrapper that passes bot detection; drop-in Playwright replacement	AI agents failing web navigation due to bot detection	Python	Shipped (1,002 stars)	GitHub
Linkchart.art	u/No_Reindeer_6824	Real-time collaborative investigation board for linking people/vehicles/locations/events	No good tool for visual network/relationship mapping	React, React Flow, Tailwind, WebSockets, SVG export	Shipped	linkchart.art
OptimistPal	u/john200ok	iOS app that blocks other apps until you reframe a negative thought positively; 100% offline	Personal mental reframing habit building	Cursor, Expo, Expo EAS	Shipped (App Store)	App Store
reddit-mcp-buddy	u/karanb192	LLM-optimized Reddit MCP server for browsing posts, searching, and analyzing users	Researching developer activity on Reddit from LLM context	TypeScript	Shipped (679 stars, 80K downloads)	GitHub
yet-another-statusline	u/nimbis	Statusline and helper scripts for Claude Code	Missing at-a-glance session status (subagent info, context state)	Python	Beta (78 stars)	GitHub
fix1517.gr	u/galaxycarpet	Dashboard exposing circular loop in Greece's tax fraud hotline (1517); poll; screenshots	Bureaucratic dead-end with no public accountability	Vibe-coded in 2 hours (stack unspecified)	Shipped	fix1517.gr

ascii-agents is a Rust/ratatui TUI that hooks into Claude Code sessions via a shim that always exits 0 (so it cannot block the agent). Characters sit at desks with monitor glow indicating current tool: edit=blue, bash=orange, read=cyan, task=purple. When waiting on permissions they stand with a "?" bubble; when idle they sleep with Z's; when bored they walk to a virtual pantry. There is an office cat. It runs over SSH and is MIT licensed. The project addresses a genuine pain point in multi-agent workflows: total opacity into what your agents are doing.

invisible_playwright reached 1,002 GitHub stars and was updated the same day this data was collected (2026-05-25). It is a drop-in replacement for Playwright that routes through a stealth Firefox instance to pass all major bot detection tests. Given the Copilot community's discussion of AI agents needing web access, this timing and traction are significant.

Linkchart.art is a three-day build that turned into a production-grade real-time collaborative tool. The UI shows rich entity cards (photos, custom fields), A* pathfinding-style layout, SVG export, and multi-user live sync. The Peaky Blinders demo shown in the community image demonstrates the richness of the data model: person, vehicle, address, event, company, note, communication objects, each with typed fields and a linked side panel.

Common build patterns: Multiple tools in this date's data are solving the same category of problem — "how do I see what my AI agent is doing, and when is it going to cost me?" (ascii-agents, yet-another-statusline, AUTOBAN Kanban extension). The convergence of three independent builders on agent observability in a single day's data suggests a genuine unmet need, not a trend triggered by one viral post.

6. New and Notable¶

Claude Code v2.1.150 Remote System Prompt Injection¶

u/matheusmoreira's reverse-engineering of Claude Code v2.1.150 revealed that Anthropic added two network-sourced injection points that were dead code in prior versions. At startup, Claude Code calls api.anthropic.com/api/claude_cli/bootstrap and caches the response to disk. Every 60 seconds, a GrowthBook feature flag (tengu_heron_brook) is refreshed in the background. Both endpoints can return strings that are injected into the system prompt alongside core behavioral instructions. The post provides binary analysis commands (npm pack, tar, strings, grep) that any user can run to verify independently. Environment variables CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC=1 and DISABLE_GROWTHBOOK=1 are documented as mitigations (post) (117 points, 30 comments). Separately, u/oqdoawtt found Claude Code injecting hard-coded permissions into the project's settings.json that persisted even after the user removed them, concluding "Claude Code was made to burn tokens fast" rather than to help developers (post) (44 points).

invisible_playwright Crosses 1,000 Stars¶

The stealth browser automation library feder-cr/invisible_playwright crossed 1,000 GitHub stars and was updated on 2026-05-25 — one day after this data was collected. It was shared by u/Laboro_ in r/GithubCopilot with 86 upvotes and minimal commentary, suggesting the signal is not yet fully absorbed by the community.

Vibe Coding as a Civic Tool¶

The fix1517.gr story is the strongest example to date of vibe coding being used as a form of civic pressure rather than product development. A two-hour session produced accountability for a government phone loop that had been operating for an unspecified time, and the resulting government reaction (deleting the page rather than fixing the problem) became its own newsworthy event. The 274-comment thread and 630-point score suggest the community sees this as a meaningful precedent.

7. Where the Opportunities Are¶

[+++] Production hardening service for vibe-coded MVPs — Both the 66-repo audit (average readiness 54.7/100) and the investor-demo failure post identify the same repeatable gap. The specific failure modes (no RLS, hardcoded keys, no CI/CD, no rate limiting, serverless cold starts, XSS) are known and fixable in a few hours each. A productized service or automated scanner that checks for these issues would address strong demand: u/techno_bomboclaut was flooded with requests for free repo reviews. The gap between "it works" and "it's safe to ship" is consistent enough to be a reliable business.

[+++] AI coding tool cost transparency layer — The GitHub Copilot pricing shock and Claude Code cache miss problem both stem from the same root cause: token costs are invisible until the bill arrives. A proxy layer that monitors real-time token consumption, flags cache-busting actions, and projects monthly costs across providers would address immediate pain. u/qcofficial has already built a private version of this ("my own backend for caching, routing, reasoning"). The demand is proven; the gap is an accessible, well-documented public implementation.

[++] Stealth web automation for AI agents — invisible_playwright's 1,002-star trajectory and recent update suggest the market for bot-detection-resistant browsers is growing as AI agents need web access. The library is open-source and Python-only; a managed service version, a TypeScript port, or a SaaS wrapper could expand the addressable market significantly.

[++] Agent observability tooling — Three independent builders (ascii-agents, yet-another-statusline, AUTOBAN Kanban extension) converged on agent monitoring in a single day. The community question "do people use so many agents at the same time?" (score=29) shows the market is still forming, but the builder interest is ahead of it. An opinionated, integrated observability layer for multi-agent Claude Code sessions — cost tracking, task status, tool usage breakdown — would consolidate these efforts.

[+] Civic accountability vibe-coding playbook — The fix1517.gr story is a template: identify a public-facing policy failure, build a minimal exposure dashboard in hours, generate media attention. This pattern could be systematized as a toolkit for civic technologists or journalists who lack coding background. The community's 274-comment engagement suggests appetite for more examples.

8. Takeaways¶

GitHub Copilot's June 1 billing change is triggering a mass migration right now. Screenshots from multiple users show 6x–24x price increases for identical usage under the new AI Credits model. Alternatives explicitly mentioned: Cursor, Gemini Code Assist, Claude Code direct API, DeepSeek Flash API. (Individual-Trip-1447)
Claude Code's cache miss costs 12.5x more than a hit, and five common session actions silently trigger it. MCP server installs, /compact, in-context file edits, directory switches, and permission changes all bust the cache without warning — and Anthropic's own docs confirm the math. (lawnguyen123)
Claude Code v2.1.150 added live remote system prompt injection that was not disclosed in the changelog. The bootstrap API and GrowthBook feature flag are documented by binary analysis; environment variables to disable them are known. This is a concrete trust signal for enterprise users evaluating Claude Code. (matheusmoreira)
Vibe-coded projects are producing real civic and commercial outcomes. A two-hour dashboard forced a government ministry to delete an official page; a mobile app built with Cursor hit 575 downloads in three days. The "calorie counter and notes app" criticism no longer captures the range of what is being shipped. (galaxycarpet, john200ok)
The average vibe-coded repo has a production readiness score of 54.7/100 — technically functional but not safe for real users. The most common gaps are no CI/CD, missing error boundaries, no input validation, XSS vulnerabilities, and zero monitoring. (techno_bomboclaut)
Cursor is now the preferred harness for Anthropic models. Multiple posts confirm that Claude Code's agentic capability is strongest when accessed through Cursor rather than the Claude Code desktop app — with Opus 4.7 completing tasks up to 20x faster. The model harness quality gap is documented by the community. (Remarkable-Bowler-60)