2 min readfrom Machine Learning

TabPFN-3 just released: a pre-trained tabular foundation model for up to 1M rows [R][N]

Our take

Today marks the release of TabPFN-3, an advanced pre-trained tabular foundation model capable of handling up to 1 million rows in a single forward pass, eliminating the need for training or tuning. Building on the success of its predecessors, TabPFN-3 offers significant improvements in scale, speed, and accuracy, achieving a remarkable 93% win rate over classical machine learning on TabArena. With versatile deployment options and enhanced capabilities, this model empowers users to transform their data-driven workflows.

The recent release of TabPFN-3 marks a significant advancement in the realm of tabular data processing, further solidifying the model's place in the evolving landscape of AI-native spreadsheet technology. TabPFN, which allows users to make predictions on tabular data with a single forward pass—eliminating the need for training, hyperparameter tuning, and extensive preprocessing—has already garnered substantial attention with its predecessor models, TabPFN-2.5 and TabPFNv2, drawing over three million downloads and powering over 200 published applications. This new iteration enhances its capabilities, particularly with its ability to handle up to one million rows on a single H100 GPU, ten times larger than its predecessor. Such scalability not only accelerates data processing but also empowers users to tackle more complex datasets, which is crucial as organizations increasingly rely on data-driven insights.

The implications of TabPFN-3 extend far beyond mere performance enhancements. Its introduction of a reduced KV cache and row-chunked inference positions it as a practical solution for users who might have previously felt restricted by legacy tools. This model doesn’t just streamline the prediction process; it redefines it, allowing users to engage with their data in ways that were previously unattainable. For those exploring modern approaches to data, such as the insights shared in Using Polars Instead of Pandas: Performance Deep Dive, TabPFN-3 offers a compelling alternative that prioritizes both speed and accuracy. The reported win rate of 93% over classical machine learning methods on platforms like TabArena is a testament to its effectiveness, demonstrating how AI can simplify and enrich the data analysis experience.

Moreover, the introduction of the Thinking Mode API and calibrated quantile regression features showcases TabPFN-3’s versatility. It not only enhances prediction accuracy but also expands the model's applicability to adjacent tasks like time-series analysis and interpretability. As organizations continue to face challenges in managing vast amounts of data, the ability to derive actionable insights quickly and accurately will be paramount. This aligns with the themes discussed in our article on 5 Useful Python Scripts for Time Series Analysis, where innovative tools are essential for navigating complex data landscapes. The potential for TabPFN-3 to lift the capabilities of various analytical tasks presents a transformative opportunity for users looking to enhance their data strategies.

Looking ahead, the accessibility and deployment options provided by TabPFN-3—including API integration, enterprise licensing, and open-source weights—are likely to foster a broader adoption of AI in data management. This democratization of technology invites users from diverse backgrounds to explore its capabilities without the heavy lifting typically associated with AI implementations. As we move toward a future where data-driven decision-making becomes increasingly integral to business success, the question remains: how will organizations adapt to and leverage these advancements? The path forward is not just about adopting new tools but about rethinking workflows and embracing a future focused on innovation and productivity. The developments brought by TabPFN-3 could very well be the catalyst for a new era in data management, one that is more intuitive, efficient, and aligned with user needs.

TabPFN-3 was released today, the next iteration of the tabular foundation model, originally published in Nature.

Quick recap for anyone new to TabPFN: TabPFN predicts on tabular data in a single forward pass - no training, no hyperparameter search, no tuning. Built on TabPFN-2.5 (Nov 2025) and TabPFNv2 (Nature, Jan 2025), which together crossed 3M downloads and 200+ published applications.

What's new:

  • Scale: 1M rows on a single H100 (10x larger than 2.5).A reduced KV cache (~8GB per million rows per estimator) and row-chunked inference make this practical on a single GPU
  • Speed: 10x-1000x faster inference than previous versions. 120x on SHAP via KV caching
  • Thinking Mode (API only): test-time compute pushes predictions further via one-time extra fitting at inference. Beats every non-TabPFN method on TabArena by over 200 Elo, including 4-hour-tuned AutoGluon 1.5 extreme. Gap more than doubles to 420 Elo on the larger-data slice.
  • Accuracy: it has a 93% win rate over classical ML on TabArena
  • Many-class: native non-parametric retrieval decoder supporting up to 160 classes
  • Calibrated quantile regression: bar-distribution regression head produces calibrated quantile predictions in a single forward pass
  • Lifts adjacent tasks: time-series, interpretability, and new SOTA on relational benchmarks.
  • 3 deployment paths: API, enterprise licensing, and open-source weights (permissive for research and academic evaluation)

You can try it here or read the model report here. Happy to answer questions in the comments.

submitted by /u/rsesrsfh
[link] [comments]

Read on the original site

Open the publisher's page for the full experience

View original article

Tagged with

#real-time data collaboration#generative AI for data analysis#Excel alternatives for data analysis#rows.com#real-time collaboration#enterprise data management#natural language processing for spreadsheets#big data management in spreadsheets#conversational data analysis#intelligent data visualization#data visualization tools#big data performance#data analysis tools#spreadsheet API integration#data cleaning solutions#machine learning in spreadsheet applications#enterprise-level spreadsheet solutions#AI-native spreadsheets#row zero#cloud-based spreadsheet applications