Skip to main content

Jan Hendrik Metzen

Research Scientist at Prior Labs · Tabular Foundation Models · Scalable Pretraining

Portrait of Jan Hendrik Metzen

I am a Research Scientist at Prior Labs, where I develop the next generation of tabular foundation models. My work focuses on scalable pretraining, novel architectures, and scaling laws. I contribute to TabPFN-3, which scales tabular foundation models to datasets with up to one million training rows while substantially accelerating training and inference.

Previously, I was a Senior AI Researcher in Aleph Alpha Research’s Foundation Models team, working on efficient LLM pretraining, tokenizer-free architectures, and large-scale optimization. Before that, I was a Senior Expert at the Bosch Center for Artificial Intelligence, where my research centered on robust and reliable computer vision, neural architecture search, and synthetic data.

My broader interests span efficient and reliable machine learning, AutoML, and open-source software. I am an ELLIS member, an ELIZA Industrial Fellow, and an Area Chair for NeurIPS 2026. I have also contributed to scikit-learn, including its probability-calibration, kernel-ridge, and Gaussian-process modules.

I live in Böblingen and work in Freiburg. Outside research, I enjoy spending time with my wife and our three children. I am always happy to connect with people working on tabular foundation models, scalable pretraining, efficient or tokenizer-free architectures, and open-source machine learning. Contact me.

Research Interests

Tabular Foundation Models and Scalable Pretraining

I work on foundation models for structured data that learn from diverse (synthetic) datasets during pretraining and can then fit entirely new datasets in context—without fine-tuning—effectively turning model fitting into fast, direct inference. My current focus is scaling these models to substantially larger datasets, understanding their scaling behavior, and improving the efficiency of both pretraining and inference. This work includes TabPFN-3, which extends tabular foundation models to datasets with up to one million training rows.

Efficient and Tokenizer-Free Architectures

I am interested in architectures that make large-scale pretraining and inference more efficient. At Aleph Alpha Research, I worked on tokenizer-free language models, domain adaptation, and large-scale optimization. Our Hierarchical Autoregressive Transformer (HAT) uses a hierarchical architecture in which the encoder and decoder operate on bytes, while the backbone operates on regex-defined words; we scaled this approach to models with up to 70 billion parameters. Building on HAT, SOMBRERO learns the sequence aggregation end to end instead of relying on predefined word boundaries.

Reliable and Robust Machine Learning

My work on reliable computer vision studied both how models fail and how to make them safer. We used generative models to uncover systematic errors of image classifiers on rare subgroups and systematic errors of object detectors, and identified vulnerabilities of Transformer-based networks to adversarial patch and token attacks.

We developed architectures that are certifiably robust against patch attacks for image classification and semantic segmentation, as well as adversarial-training methods for universal perturbations and universal adversarial patches. I also worked on test-time adaptation under domain shift and studied how shape-biased representations affect robustness to common image corruptions.

AutoML and Neural Architecture Search

The vast design space of neural networks and the diversity of inference hardware make manual architecture design difficult to scale. Hardware-aware neural architecture search can automate this process, improving design efficiency and reducing the cost of AI development. For an overview, see our survey on Neural Architecture Search and our survey on Neural Architecture Search for Dense Prediction Tasks in Computer Vision.

My work in this area also includes efficient architecture search through network morphisms, multi-objective NAS via Lamarckian evolution, and meta-learning neural architectures for few-shot learning. We also collected practical lessons for making search more effective in our Bag of Tricks for Neural Architecture Search.

I also co-developed AutoCLIP, which automatically tunes zero-shot classifiers for vision-language models and improves performance across a broad range of domains.

News

May 13, 2026 We released the TabPFN-3 technical report and model weights on Hugging Face, scaling tabular foundation models to datasets with up to one million training rows.
Mar 16, 2026 As part of my previous role at Aleph Alpha Research, I contributed to A Family of LLMs Liberated from Static Vocabularies, presenting tokenizer-free architectures for efficient pretraining, adaptation, and inference. The HAT model family is available on Hugging Face.
Feb 1, 2026 I joined Prior Labs as a Research Scientist, working on the next generation of tabular foundation models.
Aug 12, 2024 I launched SVGStud.io, an AI-powered tool for searching, generating, and customizing scalable vector graphics.

Selected Publications

2026

  1. TabPFN-3: Technical Report
    Léo Grinsztajn, Klemens Flöge, Oscar Key, Felix Birkel, Philipp Jund, Brendan Roof, Mihir Manium, Shi Bin Hoo, Magnus Bühler, Anurag Garg, Dominik Safaric, Jake Robertson, Benjamin Jäger, Simone Alessi, Adrian Hayler, Vladyslav Moroshan, Lennart Purucker, Philipp Singer, Alan Arazi, Julien Siems, Jan Hendrik Metzen, Georg Grab, Nick Erickson, Siyuan Guo, Eliott Kalfon, Simon Bing, David Salinas, Clara Cornu, Lilly Charlotte Wehrhahn, Diana Kriuchkova, Kursat Kaya, Lydia Sidhoum, Marie Salmon, Jerry Chen, Madelon Hulsebos, Yann LeCun, Samuel Müller, Bernhard Schölkopf, Sauraj Gambhir, Noah Hollmann, and Frank Hutter
    2026

2024

  1. AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models
    Jan Hendrik Metzen, Piyapat Saranrittichai, and Chaithanya Kumar Mummadi
    Transactions on Machine Learning Research, 2024
  2. Label-free Neural Semantic Image Synthesis
    Jiayi Wang, Kevin Alexander Laube, Yumeng Li, Jan Hendrik Metzen, Shin-I Cheng, Julio Borges, and Anna Khoreva
    In European Conference on Computer Vision (ECCV), 2024
  3. Feature Distillation Improves Zero-Shot Transfer from Synthetic Images
    Niclas Popp, Jan Hendrik Metzen, and Matthias Hein
    Transactions on Machine Learning Research, 2024

2023

  1. Identification of Systematic Errors of Image Classifiers on Rare Subgroups
    Jan Hendrik Metzen, Robin Hutmacher, N Grace Hua, Valentyn Boreiko, and Dan Zhang
    In International Conference on Computer Vision, 2023

2022

  1. Give Me Your Attention: Dot-Product Attention Considered Harmful for Adversarial Patch Robustness
    Giulio Lovisotto, Nicole Finnie, Mauricio Munoz, Chaithanya Kumar Mummadi, and Jan Hendrik Metzen
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022