DoorDash Used Copilot to Convert Its XCTest-Based iOS Test Suite to Swift Testing
Our take

DoorDash’s migration of a large iOS test suite from XCTest to Swift Testing, guided by GitHub Copilot and robust reliability checks, offers more than a technical case study—it presents a pragmatic blueprint for modernizing legacy codebases in the AI-augmented era. The project, as detailed by engineer Matheus Gois, achieved measurable performance gains without sacrificing stability, a balance many organizations struggle to strike. This isn't about chasing the latest framework for its own sake; it's a response to the tangible friction developers face with older tools, much like the professional grappling with a "needlessly complicated task" via a fragmented SharePoint ecosystem. That related piece underscores a universal truth: when workflows are burdened by outdated structures, productivity suffers. DoorDash’s approach—using AI to handle the syntactic heavy lifting while humans enforce semantic integrity—directly addresses that pain point.
The significance lies in the methodology. Copilot served as a force multiplier, generating initial conversion code at speed, but the safeguards—likely involving rigorous validation, incremental rollout, and performance benchmarking—ensured the migration didn't break existing functionality. This hybrid model respects the complexity of legacy systems while embracing innovation. It mirrors broader industry shifts, such as Anthropic’s recent reinstatement of OpenClaw and third-party agent usage on Claude subscriptions, albeit with caveats. Both stories highlight a maturing relationship with AI tools: we are moving past unbridled enthusiasm into an era of structured, safety-conscious integration. The goal is no longer just to use AI, but to use it wisely within established operational guardrails.
For readers, this case validates several key strategies. First, it demonstrates that AI pair programmers can effectively tackle large-scale, boilerplate-intensive refactoring, freeing human intellect for architectural review and edge-case handling. Second, it proves that performance improvements—faster test runs, reduced flakiness—are achievable without a risky "big bang" rewrite. Third, it signals a cultural shift: modernizing isn't a one-off project but an ongoing practice of aligning with language evolution, as seen in Swift’s own testing framework advancements. The performance gains are a welcome bonus, but the real win is establishing a repeatable process for technical debt reduction in an AI-supported workflow.
Looking ahead, the critical question isn't whether AI will assist in code migration—that ship has sailed—but how we will institutionalize the safeguards that make such assistance reliable. Will we develop standardized validation frameworks for AI-generated code conversions? How do we train teams to audit AI output effectively? DoorDash’s example sets a high bar, showing that the future of maintenance belongs not to the flashiest tool, but to the most disciplined integration of automation and human oversight. As AI capabilities grow, the organizations that thrive will be those that, like DoorDash, pair transformative technology with unwavering commitment to reliability—a lesson applicable far beyond test suites, perhaps even to building AI financial models in platforms like Sourcetable, where accuracy is paramount.

Using Copilot along with strong reliability safeguards, DoorDash migrated their iOS XCTest-based test suite to Swift Testing, thus modernizing a large test suite quickly, safely, and with measurable performance gains, says DoorDash engineer Matheus Gois.
By Sergio De SimoneRead on the original site
Open the publisher's page for the full experience