Dataiku ML Summit Singapore - 2025
Contents
This blog post is a brief summary of the ML Summit hosted by Dataiku on July 17th 2025, at Singapore.

Summit Notes
Welcome Address
Speaker: Randy Goh, Area VP, Dataiku
Key Points
- The welcome address mainly spoke about the various features of Dataiku present ones and the ones in the roadmap
- Dataiku has 800 firms as its customers across the world
- Velvet Sundown - 100% AI generated band that shows the importance of AI generated content in the current world
Overall Vision and Keynote: Agents Beyond Chaos
Speaker: Sophie Dionnet, SVP, Product & Business Solutions, Dataiku
Key Points
- We are just scratching the surface of:
- True AI transformation
- Real AI analytics
- AI deployment
- AI agents are everywhere:
- News from consulting firms
- Cloud providers
- LLM vendors
- Major categories:
- Agentic capabilities from AI infrastructure providers:
- Power
- Expertise
- Maintenance
- New startups
- Agents and agent builders in enterprise applications:
- Connected
- Siloed
- Extensible
- Agentic capabilities from AI infrastructure providers:
- Enterprise Applications (EAs) are evolving:
- From business rules + databases + transactions ? AI assistants
- Still fit for purpose
- But raise concerns:
- Cost
- New silos
- Relevance for your ecosystem
- Cross-application assistants:
- Leading enterprises are longing for a new class of AI-first applications
- AI-Extended Data Model
- AI Process Automation:
- Example: Automating a marketing content creation process, including legal review
- The enterprise AI stack is growing more sophisticated to meet that need:
- Cross-application assistants
- AI-extended data model
- AI-powered processes
- Key components:
- LLM Ops
- AI Governance
- AI Agent Builder
- Analytics:
- Business Intelligence
- Last-mile ETL
- Models:
- ML Ops
- Data Science / ML Platform
- Agents
- Universal AI Platform:
- Multi-cloud
- Multi-platform
- Multi-model
- Unified governance
- Agents:
- Agent creation
- Agent control
- Catalysts for going deeper
- Models:
- Model development
- Model operations
- Analytics:
- Data preparation
- Insights
- Agents introduce a new type of intelligence to the enterprise:
- Tasks:
- Agents become the primary UI
- Roles:
- Machine-like tasks are automated
- Tasks:
- Even simple agents are difficult:
- Limited functionality
- Inaccurate data
- Lack of robustness
- Hallucinations
- No way to monitor ongoing quality of outputs
- Multi-agent systems have become a chaotic mess:
- Mistakes compound in cascades of errors
- Agents created in siloed systems have become ungovernable
- Three common problems:
- Fragile connections to systems
- Lack of accountability
- Impossible to diagnose failures
- Now is the time to get agentic architecture and governance right:
- Enterprise orchestration
- Fully governed creation
- Continuous optimization
- Dataiku orchestrates:
- Analytics
- Models
- Agents
[Panel] AI in the Force
Moderator: Sophie Dionnet Panelists:
- Kian Boon Lim (HTX)
- Daniel Seet (SCDF)
- Sean Tan (MHA)
Key Points
- Paramedics deal with around 40 different protocols for various cases.
- AI is being used to:
- Analyze case feedback
- Prioritize the relevant protocols based on case patterns
- AI is being used to:
- Robotics
- Real-time incident management
- Chunk the AI project
Agentic AI: Enterprise Automation, Safety, and Scalability
Speaker: Dr. Clifton Phua, IMDA
Key Points
- Agentic AI marks a shift:
- From task-based human work
- To goal-driven AI agents
- National AI Strategy:
- Singapore’s strategy focuses on:
- Operationalizing AI
- Scaling real-world AI systems
- Singapore’s strategy focuses on:
- Virtual agents for citizen services:
- Customer service AI agents
- Merchant assistant agents
- Project Vend:
- Autonomous office shop pilot (June 2025)
- Demand response
- Excessive generosity
- AI hallucinations
- Resulted in a 20% value loss in one month
- Ultimately deemed a failure
- Demonstrated how safe experiments can reveal important AI failure modes
- Autonomous office shop pilot (June 2025)
- Implementation challenges:
- Technical limitations:
- Complex reasoning and long-term memory are still hard for AI agents
- ~20% behind human-level performance
- System integration:
- Difficulty connecting AI agents to existing APIs and legacy systems
- Trust and safety:
- Hard to ensure agent reliability
- Deployment is difficult:
- Requires practical constraints and guardrails
- ROAI (Return on AI Investment) is unclear
- Technical limitations:
- Emerging solutions:
- Technical limitations:
- Specialized agents for domain tasks
- Multi-agent architectures
- System integration:
- Open frameworks
- Standardized agent communication protocols
- Trust and safety:
- Embedded guardrails
- Sandbox testing environments
- Technical limitations:
- Recommendations:
- Start with small pilots
- Co-develop solutions with domain experts
- System-level risks:
- Goal misalignment
- Conflicting decisions
- Potential data breaches
- Solutions:
- Shared policy layers
- System-wide observability
- Collaborative innovation
- Speaker note:
- Dr. Phua created the entire slide deck using an AI agent
Technical Keynote: Multi Agent Orchestration
Speaker: Jed Dougherty, VP Platform Strategy, Dataiku
Key Points
- Provides orchestration for multi-agent frameworks at various scales:
- Team scale
- Department scale
- Enterprise scale
- Roadmap:
- Agent orchestration for reliable, composite AI
- Analytics embedded across the enterprise
- Operationalized AI governance
- EU AI Act readiness
[Fireside Chat] Order in Chaos: Bringing AI to Fruition
Moderator: Grant Case, Dataiku Speaker: Dr. David R. Hardoon, Standard Chartered Bank
Key Points
- Key question: How do we increase AI propagation?
- Single most critical safeguard for agentic AI:
- Need for simplification
- Example: If “country” is missing in a transaction, can you build an agent that handles just that?
- What is meaningful oversight?
- Start with simple, repetitive tasks and apply agentic AI
- Consider augmenting human capabilities — what more can we do?
- Most undervalued competency:
- Start with the “known knowns”
- 12 to 18 months outlook:
- Hyper-personalized experiences are becoming ubiquitous
- Emergence of “invisible banking”
AI That Makes Sense: Sensemaking Systems 2.0
Speaker: Tan Boon Leong, ST Engineering
Key Points
- Organization: ST Engineering
- Four foundational components:
- Use case
- Agile process
- Tools
- Skills
- Timeline:
- 2024 – Agentic AI: Text to Action
- 2025 – Embodied AI
- Focus: Human and machine collaboration
- Virtual assistant
- Co-pilot
- Digital mentor
- Shift from task automation to decision augmentation
- Sensemaking: Going beyond dashboards
- Multimodal data ingestion
- Contextual grounding
- Event intelligence
- Explainable recommendations
- Use cases:
- Pattern of Life Analysis:
- Identify non-compliant vessels operating near or within Singapore waters
- Maritime security – detect suspicious behavior
- Uses deep learning (sequence-to-sequence models)
- MRO Product DNA:
- How to digitalize maintenance and repair operations
- Knowledge graph construction:
- Often hard to start
- Use GenAI to:
- Extract relationships
- Represent them in graph form
- Enable natural language querying
- Pattern of Life Analysis:
- Reflections:
- Value creation by enhancing current systems to the next level
- Importance of upskilling
Building Trust in AI
Speaker: Shameek Kundu, AI Verify Foundation
Key Points
- AI Verify Foundation (AIVF) is a subsidiary of IMDA
- Message: Make AI boring again
- Reference: Spain electricity catastrophe
- Guidelines are great, but:
- It’s essential to ensure AI is reliable on the ground
- GenAI testing:
- Much focus today is on frontier risk
- Critical to ensure models are not misaligned
- Shift in focus:
- From safety to reliability
- Models must be tested as part of end-to-end systems
- Lessons learned:
- 17 companies participated with real-life GenAI applications
- Paired with professional testers to evaluate effectiveness
- AIVF’s mission: Make AI model testing reliable
- Global AI Assurance Pilot:
- Develop testing norms and practices
- Establish foundations for a viable assurance market
- Contribute to roadmaps for AI testing tools
The Agentic AI Revolution: Transforming Business Vision into Tangible Value
Speaker: Catherine Ko, Amazon Q GTM Lead – APJ, AWS
Key Points
- The Age of AI:
- Pre-2020s: Innovation cycles took 18–24 months
- GenAI era: Innovation cycles have accelerated to 4–6 months
- Agents are now capable of performing longer tasks autonomously
- Amazon is investing $100B in AI in 2025
- Evolution into Agentic AI:
- Generative AI:
- Follows a set of rules
- Automates repetitive tasks
- Generative AI Agents:
- Achieve singular goals
- Handle broader ranges of tasks
- Automate entire workflows
- Agentic AI Tasks:
- Fully autonomous
- Built on multi-agent systems
- Mimic human logic and reasoning
- Generative AI:
- Businesses are creating value with AI agents:
- Workplace productivity
- Business workflows
- Innovation and research
- Example: NFL
- Agents accelerate content production
- Agents are transforming Amazon’s internal systems:
- 10,000 apps migrated
- 4,500 developer years saved
- $260M in savings
- Example: Cognizant
- AI agents used to streamline mortgage workflows
- Example: Moody’s
- AI application built to generate comprehensive risk reports
- Time reduced from 1 week to 1 hour
- Amazon Q:
- Industry-leading specialized agents
- Amazon Q Developer:
- AWS AI-powered assistant for software development
- AWS Transform Initiatives:
- Java migration
- .NET modernization
- COBOL to Java
- Agent Demo:
- Agentic IDE released at Kiro.dev
- Features:
- Generative AI assistant
- Brainstorming support
- Agents can automate complex workflows
- Key takeaway: Put together the building blocks for agent-based automation
- Amazon Q Index in Asana:
- Asana integrates Q Index to pull data from various applications
- “Buy for Me” feature:
- Rolled out using Amazon AI
- Future interfaces:
- Designed to support data scraping by AI agents
Operationalising Data Quality for Agentic AI with Dataiku
Speaker: Sophie Dionnet
Key Points
- The agentic world has an Achilles’ heel: Data Quality
- Trusted data is the cornerstone of agent efficiency
- There is increasing demand for:
- Less intermediated access to data
- Trusted data depends on:
- Data quality
- Transparency:
- Understanding the lineage of the data
- Documentation
- Ownership
- Virtuous circle for building data trust:
- Define your rules
- Monitor dashboards
- Troubleshoot issues
- Repeat
- Any outliers are sent back (feedback loop)
- Column-level data lineage:
- Recently added feature (a few months ago)
- Issue resolution:
- Troubleshoot any form of data issues
- Documentation practices:
- Exists at all levels:
- Collection
- Project
- Dataset
- Column
- Uses:
- Expansible wikis
- Discoverable knowledge
- Exists at all levels:
- Metadata generation:
- Use LLMs to assist in generating metadata
- Evolving data roles:
- From Data Stewards to Data Influencers
- Share and reuse golden datasets
- Publication workflows
- Secured sharing mechanisms
- From Data Stewards to Data Influencers
- Dataiku provides tools to operationalize and scale data quality efforts
[Panel] The Future of Enterprise AI Infrastructure
Moderator: Grant Case Panelists:
- Geoff Soon (Mistral AI)
- Ettikan K Karuppiah (NVIDIA)
Key Points
- FSI Agent AI Blueprint:
- NVIDIA builds blueprints for AI transformation
- Today’s AI-native companies are born in the generative AI era
- These are transformation-focused companies
- NVIDIA’s position:
- Horizontal player in the AI ecosystem
- Builds:
- System libraries
- Application-specific libraries
- Uses microservices as building blocks
- Blueprints are composed over these microservices
- Vertical-specific template applications help FSI build AI applications
- LLMs:
- Train models with data ? output parameters (e.g., 7B model)
- Open-weight models:
- Make weights accessible to customers
- Enables auditing and control
- Pretrain models to fit specific domains
- Agents:
- Designed for end-to-end transparency
- Mistral AI:
- Advocates for open-source and open-weight models
- What makes NVIDIA’s blueprint different:
- Enterprise AI factory:
- Tests and validates hardware
- Software is tested and validated
- Includes:
- Frameworks
- Applications for FSI models
- Hardened models for specific environments
- NVIDIA microservices:
- Hundreds of models
- 400+ libraries
- Can be composed into complete applications
- Includes:
- Pre-processing libraries
- Post-processing libraries
- Retriever microservice
- Reranker microservice
- Reasoning engine ready for deployment
- Enterprise AI factory:
- Mistral AI on infrastructure:
- Customer of NVIDIA
- On-prem deployment has become cost-effective
- Dataiku capabilities:
- Helps manage prompts to ensure governance
- Supports LLM mesh: choose any LLM you want
- Empowers individuals with:
- Access to data
- Capabilities to integrate data and agents
- Enables tighter integration between enterprise data and agents
- Blueprint implementation:
- Dataiku serves as an orchestration platform
- Manages data access and control
- Simplifies workflow implementation when driving use cases
- Example use case: AML (Anti-Money Laundering)
- Use the right datasets
- Data governance essential
- Use NIMS:
- Guardrails at input and output
- Retrain LLMs using enterprise datasets
- Trace and track communication between agents:
- Important for explainability
- Open-weight LLMs:
- The gap between public cloud and on-prem has narrowed
- Shift from personal productivity to enterprise transformation
- Focus on:
- Specific models
- Strong governance
- Moving from human interface to agent interface
- For highly regulated industries:
- Work closely with data owners
- Understand what data and applications can be used in agentic AI
- Identify low-hanging fruit for automation
- Collaborate with LLM providers
- Do a quick PoC
- Keep human-in-the-loop as part of oversight
- Work closely with data owners
[Panel] From Promise to Practice: AI, GenAI & Agents
Moderator: TC Gan Panelists:
- Sandeep Moturi (MSD)
- Melvyn Peh (STMicroelectronics)
- Ann Tey (NTU)
Key Points
- Bio/Life Sciences Vertical:
- Current processes involve tests like angiograms
- New techniques such as digital twin:
- Capture a signature of your heart
- Predict long-term heart function using AI
- GenAI accelerates classical AI:
- Awareness ? Diagnosis ? Treatment
- Apply AI on digital twin simulations
- Use case from South Korea:
- Boosted sales by 10%
- Purchased property data, mapped lat/long of Seoul
- Identified that 20% of potential customers were not being targeted
- No AI involved – pure business analytics
- Use case with AI for pricing injections:
- Inputs:
- Social media sentiment
- Past sales data
- Doctor sentiment
- Create a model to:
- Maximize sales
- Minimize reputational risk
- Visually identify the pricing “sweet spot”
- Inputs:
- GANs used for image generation:
- Competing models generate optimal visual outputs
- High Tech Manufacturing:
- In the semiconductor industry:
- Data has always been present
- Each product passes through hundreds of machines
- AI makes machine interactions intelligent
- Helps engineers predict and respond proactively
- Legacy challenge:
- Data was siloed
- Now with Dataiku, systems can integrate and interact
- Data availability is no longer the bottleneck
- Chatbots:
- Support users by accessing the knowledge base
- Vision:
- Move towards dark factories
- Employ humanoids for autonomous operations
- NVIDIA provides tools for synthetic data creation
- Physical AI:
- The bleeding edge
- Use LLMs to generate training data for robots
- In the semiconductor industry:
- Higher Learning:
- Use cases across admin, faculty, and research:
- Connect data across silos
- Use it for machine learning applications
- Example:
- Identify students at risk of failing
- GenAI used to respond to helpdesk queries
- Physical AI in research:
- Example: solar panel chip design
- Use LLMs to instruct robots to create advanced chips
- NTU scale:
- 30,000 students and 3,000 staff
- Focus on democratizing AI within the university
- Tools:
- Use AI Co-pilot tools to democratize access and usability
- Use cases across admin, faculty, and research:
- AI Security & Governance:
- Emerging role: AI Guardian
- Similar to a security guard or police department for AI
- Need for a governing body or MCP (Model Control Panel)
- Responsible for oversight, policy, and standards
- Emerging role: AI Guardian
- Closing Thought:
- The entire AI ecosystem needs to grow together
Idea to Impact: How Dataiku Powers End-to-End AI
Speakers:
- TC Gan
- Ming Zuo Liew
Key Points
- Overview:
- Agents are everywhere
- Enabling a new class of enterprise applications
- But building those applications can be chaotic
- Market trend:
- 25% of firms doing GenAI will deploy an agent in 2025
- That number is expected to reach 50% in two years
- Universal AI Platform:
- Designed for agent creation and control
- As applications and agents multiply, things get messy:
- Multiple teams building multiple agents
- Leads to complexity and technical debt
- Demo theme:
- Not just about innovation, but about rapid creation
- AI project lifecycle:
- POC ? Pilot ? Product ? Critical Product
- Live Demo:
- Example: Singapore Airlines
- From data ingestion to dashboard insights
- Steps:
- Ingest XML data
- Prepare using recipes
- Use GenAI for:
- Entity extraction
- Dashboard creation
- Ask questions on top of the results
- Example: Singapore Airlines
- Enterprise Orchestration:
- Coordinate multi-agent and multi-model setups with security and guardrails
- Components:
- LLM Mesh:
- Securely manage connections to multiple LLMs
- Configure specific LLMs for specific use cases
- Centrally manage all LLM connections
- Sage Guard:
- Control panel for risk
- Detect PII in prompts
- Detect prompt injection attempts
- Agent Connect: Connect and manage agents
- Cost Guard: Monitor and optimize usage costs
- Quality Guard: Ensure output quality
- Trace Explorer: Observe agent behavior and lineage
- LLM Mesh:
- Analogy: Think of an F1 race setup
- Each car = an AI Agent
- F1 Pit Lane = LLM Mesh infrastructure
- Where agents go for updates and orchestration
- Sage Guard = Safety crew
- Cost Guard = Race strategist
- Quality Guard = Performance analyst
Summit pics













































