• 870: OpenAI’s “Deep Research”: Get Days of Human Work Done in Minutes
    Mar 14 2025
    In this Five-Minute Friday, Jon Krohn looks into what he considers the world’s most powerful research tool to date, OpenAI’s Deep Research. Find out how OpenAI trained Deep Research to compile literature reviews of limitless topics, what similar tools are on the market, and where Jon sees the tool as having real-world value including how he uses it daily. Additional materials: www.superdatascience.com/870 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
    Show More Show Less
    17 mins
  • 869: AI Should Make Humans Wiser (But It Isn’t), with Varun Godbole
    Mar 11 2025
    Jon Krohn talks to Varun Godbole about AI prompt engineering, generative wisdom, and AI generalists in this episode all about the interrelationships between humans and AI. Additional materials: www.superdatascience.com/869 This episode is brought to you by the Dell AI Factory with NVIDIA and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
    Show More Show Less
    1 hr and 20 mins
  • 868: In Case You Missed It in February 2025
    Mar 7 2025
    How to start a successful tech company, and how you can get started with DBT, TabPFN and BAML: Jon Krohn rounds up his favorite moments from February in this episode of “In Case You Missed It”. Additional materials: www.superdatascience.com/868 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
    Show More Show Less
    27 mins
  • 867: LLMs and Agents Are Overhyped, with Dr. Andriy Burkov
    Mar 4 2025
    The realities of Agentic AI, AGI, and chatbots that don’t hallucinate: Andriy Burkov talks to Jon Krohn about AI in 2025. Best known for his concise machine learning modelling books, author and AI influencer Andriy Burkov also talks about his latest publication in the series, The Hundred-Page Language Learning Models Book. Additional materials: www.superdatascience.com/867 This episode is brought to you by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
    Show More Show Less
    1 hr and 33 mins
  • 866: Bringing Back Extinct Animals like the Wooly Mammoth and Dodo Bird
    Feb 28 2025
    Jon Krohn addresses a question for the ages: How close are we, really, to Jurassic Park? Dallas-based biotech company Colossal Biosciences is developing technology that aims to return previously extinct animals like the dodo and wooly mammoth to earth and, crucially, pull many others like the white rhino back from the brink of extinction. Additional materials: www.superdatascience.com/866 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
    Show More Show Less
    8 mins
  • 865: How to Grow (and Sell) a Data Science Consultancy, with Cal Al-Dhubaib
    Feb 25 2025
    Jon Krohn talks to Cal Al-Dhubaib about the extraordinary success of AI and machine learning solutions provider Pandata, his ironclad hack for any company to define their core values, and how to attract and secure loyal clients. Cal thinks tech professionals make two critical mistakes in their careers: The first is that they too-often enjoy being the gatekeepers of their work rather than educating their clients and coworkers as to the details of their projects and why it benefits the company. The second is that tech professionals don’t show vulnerability, whether that means not knowing a topic or not fully understanding how a business works. This issue, Cal says, can spell the difference between a startup’s success and failure. Learn how tech startups can make an ironclad strategy for their future in this episode of The SuperDataScience Podcast. This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (09:32) How to scale a successful data science consultancy (22:25) How Pandata navigates highly regulated environments (27:59) How to tackle tech illiteracy in business (36:32) What skills Cals looks for in new hires (35:56) How to sell on a tech company Additional materials: www.superdatascience.com/865
    Show More Show Less
    54 mins
  • 864: OpenAI’s o3-mini: SOTA reasoning and exponentially cheaper
    Feb 21 2025
    Jon Krohn investigates OpenAI’s new release, o3-mini, in this five-minute Friday, where he walks through the reasoning model’s capabilities and performance, cross-examining them against other major-league players, DeepSeek-R1, GPT-4o and Claude 3.5 Sonnet. Additional materials: www.superdatascience.com/864 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
    Show More Show Less
    8 mins
  • 863: TabPFN: Deep Learning for Tabular Data (That Actually Works!), with Prof. Frank Hutter
    Feb 18 2025
    Jon Krohn talks tabular data with Frank Hutter, Professor of Artificial Intelligence at Universität Freiburg in Germany. Despite the great steps that deep learning has made in analysing images, audio, and natural language, tabular data has remained its insurmountable obstacle. In this episode, Frank Hutter details the path he has found around this obstacle even with limited data by using a ground-breaking transformer architecture. Named TabPFN, this approach is vastly outperforming other architectures, as testified by a write up of TabPFN’s capabilities in Nature. Frank talks about his work on version 2 of TabPFN, the architecture’s cross-industry applicability, and how TabPFN is able to return accurate results with synthetic data. This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (05:57) All about the TabPFN architecture (21:27) Use cases for Bayesian inference (35:07) On getting published in Nature (44:03) How TabPFN handles time series data (51:52) All about Prior Labs Additional materials: www.superdatascience.com/863
    Show More Show Less
    1 hr and 6 mins