Why R&D AI Pilots Stall and How to Get Out of Pilot Purgatory
This content was originally discussed in the presentation, From PoC to Production: Bridging the Gap in Scientific Software Development.
Software & AI
Scientific Software Development, Legacy Software Modernization, UI/UX,
Predictive Modeling, Custom Simulations, Web Applications,
Multimodal Knowledge Systems, API Development
Data Systems
Data Engineering, Process Engineering, Data Pipelining and Augmentation,
Workflow Automation and Redesign, Scientific Data Management Systems,
Data Capture Systems, High Volume Data Management, Database Design
Strategy & Design
R&D AI Transformation, R&D Digital Transformation, Strategic Roadmap Development,
Data System Design, Process Analysis
Infrastructure
Technical Upskilling for Scientists & Engineers, R&D Systems Integration,
R&D IT and Data Ops
Core Technologies
Machine Learning, Deep Learning, Baysian Optimization, Generative
Adversarial Networks, Graph Neural Networks
Advanced Modeling & Systems
Reasoning Models, Multi-Scale Modeling, Surrogate Modeling,
Simulation, Image Processing, Agentic AI Systems
Language & Generative AI
Natural Language Processing, Foundation Models, Generative AI,
Large Language Models
Discovery & Development
Property Prediction, Formulation Optimization, Structure Generation,
Materials Discovery, Materials Compatibility
Data Insights
Text Data Mining, Automated Data Analysis, Time Series Analysis,
Multimodal Search, Literature and Patent Search, Dashboards, Data Visualizations
Decision Support
Chatbots, Predictive Maintenance, Preventative Maintenance, AI
Recommendation Systems
A Technical Framework for Materials by Design | View the recording for this timely webinar on Materials by Design for enterprise R&D.
3 min read
Enthought Updated on May 21, 2026
This content was originally discussed in the presentation, From PoC to Production: Bridging the Gap in Scientific Software Development.
AI tool use among physical scientists jumped from 57% to 84% in a single year (Wiley, 2025), and the 2024 Nobel Prizes in both Physics and Chemistry recognized AI-driven research, which is a historically fast turn from cutting-edge technique to mainstream validation. The investment has followed: AI in materials discovery is projected to grow from $536 million today to $5.58 billion by 2034. By most measures, the technology has arrived.
And yet Gartner predicted that over 30% of generative AI projects would be abandoned after proof-of-concept in 2025, an S&P report found that 42% of companies scrapped most of their AI initiatives that year, and the average organization abandoned roughly 46% of proofs-of-concept before they ever reached production. In R&D settings, this failure mode is easy to recognize: the formulation optimizer that only works on one dataset, the property predictor that only one person knows how to run, the Jupyter notebook for processing instrument data that nobody has touched in six months. Models that perform well on public benchmarks and fail immediately when pointed at real company data. This is what is commonly called pilot purgatory, and it is where most AI investment quietly disappears.
AI tools make it fast and easy to build a prototype, so many teams skip the harder question of which problem to solve first. A researcher gets excited about a new technique, builds a PoC around it, and then tries to find a business problem it solves. That is backwards.
A working prototype is not a production tool. The code from early prototyping typically carries significant technical debt. Getting from "works on my laptop" to "anyone in the organization can rely on it" requires skills in version control, containerization, cloud infrastructure, CI/CD pipelines, and user feedback loops. These are not typical parts of a researcher's toolkit. Boston Consulting Group found that only 26% of companies have the capabilities to move beyond proof of concept.
A 2024 Forrester survey of 500 enterprise data leaders found that 73% named data quality and completeness as their primary barrier to AI success, ranking above model accuracy, compute costs, and talent shortages. Only 20% of business-critical information lives in structured data. The rest lives in synthesis notes, PDF reports, email threads, instrument output files, and lab notebooks. Companies with strong data integration achieve more than 10x the ROI from AI tools compared to those with poor integration.
Phase 1: Ideation. Scientists and developers work together to identify the highest-value problems. Data gaps get surfaced before building starts.
Phase 2: Prototyping. Scientist-led, rapid iteration, AI-assisted development. Organize and label key data, build something testable, get it in front of real users.
Phase 3: Validation. Test against real data. Get eyes on the code. Make sure data integration holds up under realistic conditions.
Phase 4: Hardening. Containerize, document, secure, set up CI/CD. Unglamorous work. Also what separates a prototype from something people can actually rely on.
Phase 5: Deployment. The tool is live, but iteration continues. Collect user feedback. Feed findings back into new ideation.
Scaling is 80% of the work. If your project plan ends at the prototype, you are planning for pilot purgatory.
At a materials company, domain experts were spending roughly 10 hours per week manually extracting particle data from SEM micrographs. The process was subjective and the results were hard to use downstream.
During the discovery phase Enthought identified it as a high-value, tractable problem. We built a prototype web app using the Segment Anything Model (SAM) for automatic segmentation, combined with an image labeling interface so researchers could generate labeled training data and watch segmentation quality improve in real time. Post-segmentation calculations produced quantitative outputs usable by other analysis tools.
The result was that time on this task dropped from roughly 10 hours per week to 30 minutes per week. Experts shifted from manual extraction to analysis and experiment design.
That outcome was not built on the algorithm alone. It came from a clear discovery process, close collaboration with domain experts, and getting the data infrastructure right before development began.
Learn the fundamentals (Git, command line, containerization, testing). Use AI coding tools to accelerate, but make sure you understand what they produce. Build with deployment in mind from the start. Document as if someone else will maintain it, because they probably will. Tie your work to measurable value.
Budget for all five phases, not just the first two. Build cross-functional teams that pair domain scientists with software engineers and users. Set explicit metrics for when a prototype should be scaled or retired. Killing a project that is not working is often the right call. The problem is when projects linger in pilot purgatory consuming resources without delivering value.
Fix your data first. This is the highest-leverage investment you can make. Invest in digital infrastructure before and alongside AI integration. Measure and communicate the value of tools, even small wins. Sustained ROI justifies continued investment.
Questions? Reach us at info@enthought.com
This content was originally discussed in the presentation, From PoC to Production: Bridging the Gap in Scientific Software Development.
Why Science, Why Now? The frontier AI companies have placed a coordinated bet on science. OpenAI launched FrontierScience1 and signed an MOU with the...
The conversation around AI in scientific R&D has shifted dramatically in a short period of time. Fields like chemistry, materials science, and life...