Deployed Engineer — Technical Owner of Five Enterprise Accounts

Since June 2025, I've owned the technical relationship for Fern's largest enterprise customers: Square, Cohere, Anduril, Intercom, and ElevenLabs. Each customer need flows through one pipeline: end-user reports a bug → I investigate and diagnose → I fix the underlying generator in the fern monorepo → the fix ships to the customer's published SDK. I own that loop end-to-end.

$583K
SDK ARR
(5 accounts)
25%
of All SDK
Revenue
400+
PRs to
Fern Repos
381
Support Tickets
Resolved
Account Impact
Ad-Hoc Sales & Expansion Support

Beyond the Five Accounts

Supported 100+ customers total through on-call rotations and ad-hoc support. 381 Pylon tickets resolved across the full customer base.

Including Payabli, Alchemy, Twilio, Deepgram, Vectara, Speechify, Pinecone, and more.
Incidents & Urgent Escalations
Remote Generation Outage Feb 25, 2026

Full outage of remote SDK generation. Ran point as on-call: triaged root cause (ECS failure reading API registry), managed the status page on incident.io, coordinated the fix, de-escalated. Simultaneously juggling other open on-call tickets.

SDK Version Number Incident Feb 26, 2026

SDK generation producing wrong version numbers — affecting Candid, Humansignal, and other customers. Identified that code I'd written for remote/local parity had a gap, shipped the fix (fern#12872), coordinated testing with teammates while simultaneously in a Cohere customer meeting.

TypeScript README Generation Failure Mar 9, 2026

TypeScript SDKs generating empty READMEs after a Docker base image change. Bisected the regression to the specific generator version, confirmed it reproduced in both local and remote modes, spun up Devin to fix, worked through dinner to keep it moving.

Context-switching under load Ongoing

Regularly deprioritized planned work to handle urgent customer escalations. Example: pulled off Square SDK work to urgently update Cohere's TypeScript SDK — "Got pulled into updating cohere's ts sdk since they seem to want that urgently. Will be revisiting the square sdk checks passing tomorrow." Weekend catch-up sessions to compensate for on-call interruptions during the week.

Flagging Problems & Driving Solutions
On-call handoff ownership gap Mar 2026

Flagged a systemic gap: "What happens when large effort bugs are uncovered from an on-call rotation? Who owns these? Who owns saying to the customer 'We aren't going to support this soon'?" Referenced the SPS Commerce $96k ARR thread as a concrete example. Directly led to Alex formalizing the handoff process.

Weekend on-call coverage gap Mar 2026

Noticed the sales eng on-call schedule wasn't active on weekends — customer issues were falling into a void until Monday. Flagged it, Chris agreed "we can't be ignoring things on weekend." Got myself admin access to Pylon and incident.io to design and implement the fix.

PR review policy for AI-authored code Feb 2026

Initiated an @here thread in #general-engineering asking what the team's actual practice was around self-approving AI-authored PRs. Created a data-gathering poll (behavior, not opinions). Got substantive responses from Alex, Patrick, Niels, Aditya, Sandeep. Team aligned on "100% confidence = self-approve, otherwise get human eyes."

Internal response time SLAs Feb 2026

Proposed implementing internal response time SLAs with incident.io integration so customer alerts don't get lost in Slack noise. Scoped the right questions: do SLAs apply to on-call only or account owners too? What about working hours?

Version incident → registry version lookup system Feb 2026

Turned the version number incident into permanent infrastructure: designed and shipped registry version lookups across all SDK languages (npm, PyPI, Maven, NuGet, RubyGems, crates.io, Go proxy) with private registry auth and GitHub release fallback (fern#12905, fiddle#617). Incident → systemic fix.

SSE multiline parsing — latent cross-language bug Aug 2025

Fixed Go SSE multiline data field parsing (fern#8604, fern#8630), then proactively flagged: "I think they may be broken in other languages too, but nobody has noticed because most SSE services return data in a single line." Identified a latent bug across the full generator matrix before any customer hit it.

Multi-auth test scaffolding Nov 2025

Discovered multi-auth wasn't working for Anduril. Built the integration test fixture (fern#10635), then assigned per-language fix tickets to the right people (Java→Tanmay, Python→Thomas, Go→Patrick). Created the scaffolding for others to fix.

Self-hosted docs deployment — structured remediation plan Oct 2025

Anduril's self-hosted docs had recurring deployment failures. Summarized the root causes (timeouts, concurrent pod errors, no-downtime breakage), then wrote a prioritized attack plan: shared AWS env → container startup fix → deployment time under 5 minutes → concurrent version errors. Drove deployment time from 10+ minutes to under 5.

Python README release — blast radius thinking Jun 2025

Found that a Python generator release broke README generation. Immediately asked: "Should we recall the release versions? Is there a way to only mark the python generator subpackage as recalled?" Thinking about customer impact and rollback strategy, not just the fix.

What I'm Investing In Now