AI DEVELOPMENT PLATFORM
One Platform for the Entire LLM Development Lifecycle
COMPANY
Microsoft
ROLE
Product Designer
TOOL
Figma
TIMELINE
6 Months
Prompt Flow — Industrializing Prompt Engineering
From creative prompt experiments to enterprise- ready AI workflows
Prompt Flow is Microsoft’s internal platform that standardizes the end-to-end prompt engineering workflow.
It enables data scientists, AI developers, and product teams to collaboratively build, evaluate, and operationalize LLM-driven solutions with the same rigor as traditional software engineering.
My Delieverable
Role Coverage
Conducted research, including interviews, and marketing analysis.
Formulated strategic concepts, and defined project scope.
Crafting low to high-fidelity wireframes, and delivering the final UI design
Workflow Type
Authoring, Evaluation, Deployment
Impact
Faster iteration cycles, Traceable quality metrics, Scalable deployment
AI Engineers / Data Scientists / PMs
Product Overview
Create a New Prompt Flow: Select a pre-built template or blank page, Enhance the efficiency of prompt engineers.
Show Prompt Flow in Visual Way: DAG view reflect nodes in a flattened view.
Add and Set a Node: After adding an LLM node, users can set parameters, and edit prompt.
Link Nodes: Link each node by editing inputs, avoid confusion if there are multiple inputs in one node.
Test Prompt & Debug: Run flow to test prompt or use natural language directly in chat mode, to help prompt engineers understand the performance.
Run Specific Node to Check Performance
Run the Whole Flow to Check Performance
Collaborate with Team Members: In the workspace, check teammates' prompt flow, duplicate them as a reference, template, and help debug.
CHALLENGES OF TARGET USER - PROMPT ENGINEER
• High Learning Curve
Non-technical users often struggle with the complexity of AI tools, requiring more intuitive interfaces and simplified workflows.
• Lack of Collaboration
Developers and non-technical team members face communication barriers, resulting in delays and misaligned goals during AI integration.
• Limited Customization Options
Users need flexible tools to personalized AI outputs for diverse use cases, which are often restricted in existing products.
• Inefficient Iteration Process
Current workflows make it difficult to test and refine AI models quickly, hindering real-time innovation.
• Resource Constraints
Startups and smaller teams often lack the technical resources to fully leverage advanced AI functionalities.
HYPOTHESIS & GOALS
Providing an integrated, end-to-end platform for the entire LLM lifecycle, accessible to technical and non-technical users.
User Goals
Business Goals
Enable users to effectively design, test, and deploy AI workflows with ease, supporting non-technical users through an intuitive interface.
Position Prompt Flow as the leading LLM platform, driving Azure AI adoption and expanding its reach to diverse user groups.
PLANNING & PROJECT MILESTONE
Providing an integrated, end-to-end platform for the entire LLM lifecycle, enabling efficient development, deployment, and management of AI models.
Research
UNDERSTAND PROMPT ENGINEERING DOMAIN & INDUSTRY
Existing prompt engineering tools either prioritize simplicity for experimentation or advanced features for professionals, but fail to provide an integrated, user-friendly solution for the entire workflow.
Product Opportunity Gap
What we MUST HAVE
Existing tools either lack comprehensive functionality or are too complex, creating a need for a user-friendly, all-in-one platform for the LLM lifecycle.
A comprehensive platform with intuitive interfaces and advanced capabilities to support end-to-end LLM workflows effectively.
“I usually wonder if my prompts is good enough, and testing it in real-time can be challenging.”
#1 Simplified Prompt Validation
We started to play with 15+ prompt engineering products and summarized the insights from different aspects such as features, user flow, and UI pattern.
PAINPOINTS FROM INTERVIEW
What do our users struggle/concern with the most?
Uncertainty About Prompt Accuracy
Fragmented Workflow Across Multiple Tools
Onboarding Challenges for Non-Technical Users
KEY INSIGHTS FROM PAINPOINTS
DESIGN PRINCIPLE REVIEW
66% of participants expressed a lack of confidence in the accuracy of the prompts they create. Many noted the high cost and effort associated with repeated testing and refinement, leading to frustration and inefficiency.
80% of participants struggled with the need to switch between various tools for prompt writing, testing, refinement, deployment, and monitoring. This disjointed process made it difficult to maintain efficiency and focus, creating additional cognitive and operational burdens.
60% of participants highlighted difficulties in getting started with prompt engineering. Non-technical users often feel overwhelmed by the technical complexity and lack of guided workflows, frequently relying on external expertise or pre-existing templates to overcome initial barriers.
“I create prompts using CDEX, but deploying my chatbot with third-party products often takes significant time and effort.”
#2 Integrated Workflow for Efficiency
“I’m not a developer and I don’t actually know how to do with a blank sheet, so I ask help form my team’s expertise.”
#3 Accessible Onboarding for Non-Technical Users
We are focusing on providing transparent workflows and actionable insights throughout the prompt engineering process.
Empower users through intuitive control
Prompt Flow addresses user challenges by combining end-to-end integration with an intuitive interface that simplifies complex workflows. By unifying prompt creation, testing, and deployment in one platform, it minimizes inefficiencies and enhances productivity.
Define the MVP
DECISION MAKING METHODOLOGY
To align user goals with business goals while considering constraints,
We defined feature deliverables for the MVP stage using Product Roadmap.
Prioritization
MVP features
User Goals
We scoped features based on research insights and got feedback from designers, engineers, and product managers. My task is to design Prompt Flow development and evaluation.
Initialization
User guide
Workspace
Prompt Flow Template
Experimentation
Create & modify Prompt flow
Run flow against sample data
Evaluate prompt
Bulk Evaluation
Run flow with bulk dataset
Evaluate prompt
Production
Deploy & monitor flow
Expand Accessibility Across User Segments
Empower Non-Technical Users
Enhance Workflow Efficiency
Business Goals
Accelerate Adoption
The solution directly addresses core user needs
Essential features needed to launch MVP
Engineer resources & technical feasibility
💡How we decide on MVP features
Design Goals
Develop Intuitive, visually guided experiences for technical and non-technical users
Create an integrated workflow to eliminate the need for multiple tools
NARROW DOWN THE PROBLEM FOR THE MVP
How might we design a prompt evaluation system that provides clear metrics, simplifies the evaluation process, and supports technical and non-technical users to optimize workflow?
User Flow Iteration
After testing the initial user flow with developers, we discovered friction around the separate configuration steps for “Set Bulk Run” and “Set Evaluation.” To address this, I merged these two actions into a unified configuration step—Set Bulk Run & Evaluation Flow. This adjustment reduces one screen and step, resulting in a ~20% decrease in the workload related to bulk settings, while also simplifying the mental model for users.
Key Screens
AzureML Toolkit Based on Fluent 2 Web UI
I collaborated with fellow AzureML Design team members to build the AzureML Toolkit using the Fluent 2 Web UI framework.
Why it works
The toolkit enabled a consistent and cohesive design language across multiple AzureML functionalities, ensuring a unified user experience.