Jan Hendrik Metzen
Research Scientist at Prior Labs · Tabular Foundation Models · Scalable Pretraining
I am a Research Scientist at Prior Labs, where I develop the next generation of tabular foundation models. My work focuses on scalable pretraining, novel architectures, and scaling laws. I contribute to TabPFN-3, which scales tabular foundation models to datasets with up to one million training rows while substantially accelerating training and inference.
Previously, I was a Senior AI Researcher in Aleph Alpha Research’s Foundation Models team, working on efficient LLM pretraining, tokenizer-free architectures, and large-scale optimization. Before that, I was a Senior Expert at the Bosch Center for Artificial Intelligence, where my research centered on robust and reliable computer vision, neural architecture search, and synthetic data.
My broader interests span efficient and reliable machine learning, AutoML, and open-source software. I am an ELLIS member, an ELIZA Industrial Fellow, and an Area Chair for NeurIPS 2026. I have also contributed to scikit-learn, including its probability-calibration, kernel-ridge, and Gaussian-process modules.
I live in Böblingen and work in Freiburg. Outside research, I enjoy spending time with my wife and our three children. I am always happy to connect with people working on tabular foundation models, scalable pretraining, efficient or tokenizer-free architectures, and open-source machine learning. Contact me.
Research Interests
Tabular Foundation Models and Scalable Pretraining
I work on foundation models for structured data that learn from diverse (synthetic) datasets during pretraining and can then fit entirely new datasets in context—without fine-tuning—effectively turning model fitting into fast, direct inference. My current focus is scaling these models to substantially larger datasets, understanding their scaling behavior, and improving the efficiency of both pretraining and inference. This work includes TabPFN-3, which extends tabular foundation models to datasets with up to one million training rows.
Efficient and Tokenizer-Free Architectures
I am interested in architectures that make large-scale pretraining and inference more efficient. At Aleph Alpha Research, I worked on tokenizer-free language models, domain adaptation, and large-scale optimization. Our Hierarchical Autoregressive Transformer (HAT) uses a hierarchical architecture in which the encoder and decoder operate on bytes, while the backbone operates on regex-defined words; we scaled this approach to models with up to 70 billion parameters. Building on HAT, SOMBRERO learns the sequence aggregation end to end instead of relying on predefined word boundaries.
Reliable and Robust Machine Learning
My work on reliable computer vision studied both how models fail and how to make them safer. We used generative models to uncover systematic errors of image classifiers on rare subgroups and systematic errors of object detectors, and identified vulnerabilities of Transformer-based networks to adversarial patch and token attacks.
We developed architectures that are certifiably robust against patch attacks for image classification and semantic segmentation, as well as adversarial-training methods for universal perturbations and universal adversarial patches. I also worked on test-time adaptation under domain shift and studied how shape-biased representations affect robustness to common image corruptions.
AutoML and Neural Architecture Search
The vast design space of neural networks and the diversity of inference hardware make manual architecture design difficult to scale. Hardware-aware neural architecture search can automate this process, improving design efficiency and reducing the cost of AI development. For an overview, see our survey on Neural Architecture Search and our survey on Neural Architecture Search for Dense Prediction Tasks in Computer Vision.
My work in this area also includes efficient architecture search through network morphisms, multi-objective NAS via Lamarckian evolution, and meta-learning neural architectures for few-shot learning. We also collected practical lessons for making search more effective in our Bag of Tricks for Neural Architecture Search.
I also co-developed AutoCLIP, which automatically tunes zero-shot classifiers for vision-language models and improves performance across a broad range of domains.
News
| May 13, 2026 | We released the TabPFN-3 technical report and model weights on Hugging Face, scaling tabular foundation models to datasets with up to one million training rows. |
|---|---|
| Mar 16, 2026 | As part of my previous role at Aleph Alpha Research, I contributed to A Family of LLMs Liberated from Static Vocabularies, presenting tokenizer-free architectures for efficient pretraining, adaptation, and inference. The HAT model family is available on Hugging Face. |
| Feb 1, 2026 | I joined Prior Labs as a Research Scientist, working on the next generation of tabular foundation models. |
| Aug 12, 2024 | I launched SVGStud.io, an AI-powered tool for searching, generating, and customizing scalable vector graphics. |