The Hypothesis
There's significant value trapped in government processes. In a perfect world, we'd have AI-assisted reviews for every permit type, available 24/7. Humans stay in the loop—lending domain expertise, identifying where intervention is needed, coding their knowledge into systems.
The specific opportunity: NYC business permit approvals, specifically ALT-2 permits (Interior Alterations for commercial spaces—cafes, offices, salons). These are the most frequently filed permit type in NYC.
The timing: Zohran Mamdani's tech-forward administration receptive to new approaches is in early days of governance. His Executive Order #11 called for reviewing all city regulations to reduce the "Friction Tax" on small businesses—creating an opening for innovation.
Initial assumption: Permit delays create a "wait tax" worth billions to business owners. While permits are pending: businesses pay rent on closed spaces, employees may be on payroll without revenue, and opportunity cost compounds daily (~$400/day in carry costs).
With an assumed ~47-day average approval time and ~50,000 permits filed annually, this looked like massive trapped value. A talented data scientist could calculate the value locked in each business by zip code, business type, and square footage—the potential impact seemed enormous.
The belief: City government should serve citizens and businesses. Help them open ASAP. Publish requirements with clear expectations. If every municipality open-sourced their rules and published them with annotations and links (like Genius for government regulations), technology could unlock this. Regardless of political affiliation, race, or economic status—we should build government to suit its people.
The Build
Phase 1: Ideation (Gemini voice mode - 1-2 days)
Used Gemini conversationally to brainstorm ways a municipality could leverage tech to positively impact constituents. Looking for low-hanging fruit where AI could optimize processes and reduce wait times.
Workflow:
- Started conversations in voice mode
- For longer thoughts: switched to keyboard to type, then clicked microphone to have Gemini transcribe
- Let Gemini respond in text
- Clicked microphone again for "shallow voice mode"—asked it to respond verbally
- Continued same chat over 1-2 days
End of Phase 1: Asked Gemini to summarize the conversation and opportunity assessment. Specifically focused on ALT-2 Business Permits in NYC—chosen because I lived there and the new administration appeared receptive to innovation.
Lesson learned: Context window degradation likely reduced quality of later recommendations. When using voice for extended ideation sessions, periodically start fresh conversations to maintain quality.
What didn't work: Trying to test voice mode while driving. Not particularly efficient—need focused time for this kind of exploration.
Phase 2: Data Validation & Analysis (Claude chat)
Moved from Gemini to Claude for deeper data validation before building anything.
Used Claude to pull from NYC Open Data and analyze ALT-2 permit processing times. This is where assumptions started to crack.
Phase 3: Prototyping (Claude Code + Bolt.new - 1 day)
Tools progression:
- Claude chat - Data analysis, validation, strategy
- Claude Code in VS Studio - Building, file management, structure
- Bolt.new - Rapid front-end generation (first time using this tool)
Timeline:
- Total project: 1-2 days
- Bolt.new: Working prototype in first session
- Published: Same day as initial build
- Iterations: Back and forth between Claude Code (building) and Claude chat (explanations, GitHub backup guidance)
Cost: All tools on paid tier ($20/month each: Gemini Pro, Claude Pro, Bolt.new). Used majority of Bolt allocation in first day.
The test: Could someone with operator experience (not deep coding background) ship working prototypes using AI tools?
Result: Yes—published working site in 1 day.
What the Data Revealed
Initial hypothesis: ~47-day average approval time, ~50,000 permits/year, massive "wait tax" on small businesses.
Reality after Claude's analysis:
- 75% of ALT-2 permits process in <2 weeks (mostly renewals, no problem)
- 25% averaged 95 days (but reasons unclear)
- Affected volume: ~12,500 businesses/year, not 50,000
Critical feedback from Drew (data scientist friend): "Are businesses actually losing money while waiting?"
Good point: In Manhattan retail, landlords often pay for Tenant Improvements as part of lease negotiations. Business doesn't move in or pay rent until build-out is completed.
My realization: Lease structures vary significantly. Manhattan retail ≠ outer borough mom & pop shops. Sophisticated commercial tenants ≠ small bodegas. Mamdani's focus is likely outer borough small businesses where the tenant pays for build-outs and bears the cost of delays.
The data trust issue: Initial data returns told one story, but I'm not confident enough in the conclusions. Used Claude to pull from NYC Open Data, but didn't independently validate the analysis deeply enough. The 25% with 95-day delays might be due to physical world constraints—permits tabled because the applicant needs to make physical alterations (ADA compliance, structural corrections) before approval.
This is an example of where your "flight heading" could be a few degrees off because you trusted an assumption or decision made by AI without stress-testing it yourself.
Honest assessment: Problem exists but is narrower and more nuanced than initial hypothesis. Can't confidently say this is crisis-level until data is cleaner and independently validated.
What I Learned
Technical Skills
- Gemini voice mode: Excellent for rapid ideation, but context window degrades over multi-day conversations. Start fresh periodically.
- Voice workflow quirk: Switching between typing (for transcription) and clicking mic (for voice response) works, but feels clunky. Need focused environment—not while driving.
- Claude vs Claude Code: Chat interface good for analysis/strategy, Code in VS Studio for actual building.
- Bolt.new: Powerful for non-technical builders. First time using it, produced working front-end immediately. Lower barrier to prototyping than expected. Burns through monthly allocation fast.
- Tool chain that worked: Gemini (ideation) → Claude (validation) → Claude Code (structure) → Bolt (execution)
Process & Judgment
- Seek domain expert feedback early. Drew's landlord/TI insight shifted my understanding of who actually bears the cost of delays.
- Trust but verify. I relied on Claude's data analysis without independent validation. Should have dug deeper into NYC Open Data myself before building.
- Nuance matters. Manhattan retail ≠ outer borough mom & pop. Lease structures vary. One-size-fits-all analysis misses reality.
- Know when to pause. Drew suggested contacting Mamdani's team directly (they're hyper-online and responsive). I decided to wait—"you get one shot at a first impression." Better to refine or pivot before going public.
- When vibe coding, resist 1-shot website copy. People can tell when language isn't dialed in. Clearly articulating problems and solutions requires depth. I approached this with humility—shallow understanding can generate insights, but you need to acknowledge the gaps.
Product Thinking
- Simplification principle matters. Question all requirements. Throughout my career, I've refined workflows to be highly effective and comprehensive. First principle: simplify.
- Modern company-building approach works. Demo the prototype, gain supporters on social media, validate concept before building full product and paying for scale.
- Real unlock isn't technical. The tools are ready. A non-technical operator can now ship working prototypes to test ideas. The bottleneck isn't coding ability anymore—it's idea quality and data validation.
- Collaboration accelerates impact. Linking with developers, data scientists, and domain experts would make projects like this more rigorous.
What This Project Was Really Testing
As much about testing AI tools as testing the business hypothesis.
Questions I was answering:
- Can Gemini voice mode generate quality ideas?
- Can Claude validate those ideas with data?
- Can Claude Code + Bolt produce working websites?
- Can someone with CEO/COO skills (not deep technical background) ship prototypes?
Answers:
- Yes, but context window management matters
- Yes—Claude caught my wrong assumptions (when I dug deeper)
- Yes—working site in 1 day
- Yes—this is the unlock
The Decision
Status: Live prototype, but holding before public promotion.
Why:
- Data confidence too low to pitch publicly
- Needs independent validation of NYC Open Data
- Should segment analysis by geography (outer boroughs) and business type (tenant-responsible vs. landlord TI)
- Physical world constraints vs. bureaucratic delays still unclear
Drew's suggestion: Contact Mamdani's team directly—they're online and accessible.
My call: Wait. Either:
- Deepen data analysis and refine scope, OR
- Pivot to higher-impact civic tech opportunity
Portfolio value: Demonstrates that I validate assumptions, seek expert feedback, and kill or pause projects when data doesn't support them. The workflow proved itself even if this specific problem wasn't the right one.
If I Continued - What I'd Do Differently
1. Deeper data work
- Pull raw permit data from NYC Open Data myself
- Independently validate Claude's analysis
- Segment by borough, business type, lease structure
- Interview small business owners in outer boroughs about actual costs
2. Calculate actual "cost per day"
- Rent paid while closed (varies by neighborhood)
- Employee costs (if applicable—likely not for small shops)
- Revenue opportunity cost
- Segment by zip code, square footage, business type
3. Identify real bottlenecks
- Which permit types actually have systemic problems?
- Where is bureaucratic red tape vs. legitimate review needed?
- What can be automated vs. what requires human judgment?
- Why do 25% take 95 days? Physical corrections needed or process breakdown?
4. Add "prove it" session
There has to be a framework companies use (Anthropic, Y Combinator) for stress-testing core assumptions before building. I was more focused on proving the tools worked than proving the problem existed. Next time: prove the problem first.
What This Project Proved
- I can ship working prototypes in 1-2 days using AI tools
- I validate with data and expert feedback, not just assumptions
- I pause or kill projects when hypotheses don't hold
- I care about impact (serving citizens and businesses), not just building for building's sake
The meta-lesson: Technology should unlock government efficiency. We can and should build systems where humans lend expertise while AI handles routine processing. This specific implementation wasn't the right problem, but the approach and tools work.
The real unlock: The modern way to build—ship prototype, gather feedback, validate problem, iterate or pivot—works. And AI tools make this accessible to operators without deep technical backgrounds.
What's Next
Options being explored:
- Benefits Access Navigator - Help New Yorkers find and claim government benefits (SNAP, housing vouchers, healthcare)—higher impact, millions affected vs. thousands
- 311 Modernization - Block-by-block transparency map showing which neighborhoods get city services fast and which are ignored
- Deeper dive on this project - If right data partner or city contact emerges
Current status: GreenLight NYC remains live as portfolio piece. Demonstrates rapid prototyping, data-driven decision making, and willingness to pivot when evidence demands it.
Built: January 2026
Timeline: 2-4 sessions over 1-2 days + iterations
Tools: Gemini Pro ($20/mo), Claude Pro ($20/mo), Claude Code (VS Studio), Bolt.new ($20/mo)
Status: Live prototype, paused pending deeper validation or pivot decision