One Cerebral
Posts
How Startups Can Use Agentic AI

How Startups Can Use Agentic AI

Co-Founder/CTO at Flip AI: Sunil Mallya

Jiten Patel
February 28, 2025

Credit and Thanks: 
Based on insights from The TWIML AI Podcast with 
Sam Charrington.

Key Learnings

Flip AI emphasizes the importance of integrating diverse data types for effective AI observability, enhancing operational efficiency.
The CoMELT framework combines code, metrics, events, logs, and traces, providing a comprehensive approach to training LLMs.
A mixture of experts (MOE) approach allows for specialized models tailored to specific tasks, improving performance and adaptability.
Establishing clear roles and interfaces within teams fosters accountability and enhances workflow efficiency in complex systems.
The focus on high-quality test sets over training sets ensures that models are rigorously evaluated for real-world applicability.

_{Today’s Podcast Host:}_{Sam Charrington}

Title

An Agentic Mixture of Experts for DevOps

Guests

Sunil Mallya

Guest Credentials

Sunil Mallya is the co-founder and CTO of Flip AI, a company focused on enterprise AI agents for observability, which he started in February 2022. His impressive career includes roles as Head of Amazon Comprehend at AWS, Principal Deep Learning Scientist for AWS DeepRacer, and co-founder/CTO of Neon Lab, a neuroscience and ML venture-funded startup. Mallya has over 15 years of experience in AI, ML, and technology, with 25+ patents filed in machine learning and reinforcement learning, and has delivered more than 50 machine learning talks at top-tier industry and academic conferences.

Podcast Duration

1:14:39

Read Time

Approx. 5 mins

Deep Dive

Sunil Mallyal shares insights into the innovative approaches his company is taking to address the challenges of AI observability. Flip AI is focused on creating an observability layer that enhances operational efficiency for developers, particularly in the realm of machine learning operations (MLOps). Mallyal's journey from working on AWS's Deep Racer team to founding Flip AI illustrates the importance of recognizing emerging trends and pivoting accordingly. This adaptability is crucial for startup founders who must remain agile in a rapidly evolving tech landscape.

Central to Flip AI's approach is the concept of CoMELT, which stands for the integration of code, metrics, events, logs, and traces. This framework allows for a comprehensive understanding of operational pain points, enabling the training of large language models (LLMs) on domain-specific data. Mallyal emphasized that the training data is not just a collection of logs but a curated dataset that reflects the unique challenges faced by developers. For founders, this highlights the importance of investing time in curating high-quality training data that is relevant to their specific domain. By doing so, they can enhance the performance of their AI models and reduce the time spent on debugging and root cause analysis (RCA).

The integration of time series data with LLMs is another key theme discussed. Mallyal explained that traditional models often struggle with time series data, which is critical for understanding trends and anomalies in operational metrics. By developing a hybrid approach that combines traditional models with LLMs, Flip AI can provide more accurate insights into system performance. Founders should consider how they can leverage different types of data to improve their AI solutions, recognizing that a one-size-fits-all approach may not yield the best results.

Mallyal also touched on the use of vision language models (VLMs) in data interpretation. While VLMs have shown promise in identifying visual patterns, Mallyal noted that they can struggle with the nuances of operational data. This serves as a reminder for founders to carefully evaluate the tools they choose for their projects, weighing the trade-offs of different technologies. By understanding the strengths and limitations of various models, founders can make informed decisions that align with their specific needs.

The Agents, Actors, and Director framework introduced by Mallyal provides a structured approach to managing complex workflows in AI systems. This framework delineates clear roles and responsibilities, allowing teams to operate more efficiently. Founders can draw from this by implementing similar structures within their organizations, ensuring that team members are empowered to take ownership of their tasks. The emphasis on well-defined interfaces and repeatable patterns is essential for building scalable systems, and founders should prioritize establishing these principles in their own operations.

Mallyal's insights into the mixture of experts (MOE) approach in model training further illustrate the need for flexibility and adaptability in AI development. By leveraging multiple models tailored to specific tasks, startups can enhance their operational capabilities while minimizing the risks associated with relying on a single model. This approach encourages founders to think critically about model selection and to be open to integrating new technologies as they emerge.

The discussion also highlighted the importance of a robust test set in evaluating model performance. Mallyal emphasized that while training data is important, the focus should be on creating a representative test set that accurately reflects the challenges the model will face in real-world applications. Founders should adopt a similar mindset, ensuring that their evaluation processes are rigorous and that they are prepared to iterate on their models based on performance feedback.

In summary, Mallyal's experiences and insights provide a wealth of knowledge for startup founders. By focusing on the integration of diverse data types, establishing clear frameworks for team collaboration, and prioritizing high-quality training and test data, founders can position their startups for success in the competitive AI landscape. The lessons learned from Flip AI's journey serve as a valuable guide for those looking to innovate and excel in the field of AI observability and beyond.

Actionable Insights

Curate domain-specific training data to improve the accuracy and relevance of AI models in your startup.
Implement a structured framework for team collaboration, defining clear roles and responsibilities to enhance productivity.
Leverage a mixture of experts approach in model training to optimize performance across various tasks.
Prioritize the development of a robust test set to evaluate model performance and ensure reliability in real-world scenarios.
Encourage a culture of proactive problem-solving within your team, allowing members to take ownership of their tasks and contribute to operational success.

Mind Map

Key Quote

"Machines talk to each other with APIs and they express pain in logs."

Future Trends & Predictions

As AI technology continues to evolve, the integration of various data modalities will become increasingly important for startups aiming to enhance their operational efficiency. The trend towards more specialized models, such as those utilizing the MOE approach, will likely gain traction as companies seek to optimize their AI systems for specific tasks. Additionally, the emphasis on observability and root cause analysis will drive the development of more sophisticated tools that enable organizations to proactively address issues before they escalate, ultimately leading to improved service reliability and customer satisfaction.

Check out the podcast here:

_{Latest in AI}

1. Hugging Face has released SmolVLM2, a family of open-source Vision Language Models (VLMs) that are claimed to be the smallest ever released. The new models include SmolVLM-256M and SmolVLM-500M, with the 256M parameter version being touted as the world's smallest VLM, designed for efficient on-device inference while maintaining strong multimodal performance.

2. Crunchbase has developed an AI tool that predicts startup success by analyzing proprietary data generated by its 80 million users. The system uses signals such as profile edits, employee searches for investor profiles, and spikes in investor interest to predict when startups will raise funding, get acquired, or go public.

3. Microsoft's AI2BMD model advances protein dynamics simulation with near ab initio accuracy, efficiently simulating proteins with over 10,000 atoms. This development could unlock new capabilities in biomolecular modeling, especially for processes where high accuracy is needed, such as protein-drug interactions.

_{Startup World}

1. Mercor, an AI-powered hiring startup, has raised $100 million in Series B funding, bringing its valuation to $2 billion. The company's platform automates the entire hiring process, from resume screening to AI-powered interviews, and claims to predict job performance better than traditional methods.

2. AI programming startup Codeium is reportedly raising a new round of funding at a valuation close to $2.85 billion, more than doubling its value from six months ago when it was valued at $1.25 billion. The round is said to be led by Kleiner Perkins, with Codeium's Annual Recurring Revenue (ARR) estimated at approximately $40 million.

3. Together AI, a San Francisco-based AI cloud platform backed by Salesforce and Nvidia, has raised $305 million in a Series B funding round led by General Catalyst and co-led by Prosperity7 Ventures, valuing the company at $3.3 billion.

❝

_Analogy

Building a successful AI startup is like assembling a symphony orchestra. Sunil Mallyal’s Flip AI doesn’t rely on a single instrument but harmonizes diverse elements—code, metrics, logs, and traces—into a cohesive observability layer. Just as a conductor ensures each section plays its part, founders must integrate different technologies, data types, and frameworks to create a well-tuned system. The key isn’t just collecting data but curating the right dataset, much like selecting skilled musicians for the orchestra. Success comes from orchestrating these moving parts with precision, adaptability, and a deep understanding of how each contributes to the final masterpiece.

What did you think of today's email?

Your feedback helps me create better emails for you!

Loved it

It was ok

Terrible

Thanks for reading, have a lovely day!

Jiten-One Cerebral

_{All summaries are based on publicly available content from podcasts. One Cerebral provides complementary insights and encourages readers to support the original creators by engaging directly with their work; by listening, liking, commenting or subscribing.}

Reply

or to participate.