---
slug: 2026-arxiv-2604.18837-qsvm-tabular-benchmarks
title: 'Benchmarking Quantum Kernel Support Vector Machines Against Classical Baselines
  on Tabular Data: A Rigorous Empirical Study with Hardware Validation'
source: arXiv
link: https://arxiv.org/abs/2604.18837
category: app
triage_date: '2026-05-16'
triager: port_until_julien_parallel
bootstrap_date: '2026-06-07'
---

# Dossier: Benchmarking Quantum Kernel Support Vector Machines Against Classical Baselines on Tabular Data: A Rigorous Empirical Study with Hardware Validation

| Field        | Value |
|--------------|-------|
| Slug         | `2026-arxiv-2604.18837-qsvm-tabular-benchmarks` |
| Source       | arXiv |
| Link         | [https://arxiv.org/abs/2604.18837](https://arxiv.org/abs/2604.18837) |
| Category     | app |
| Triaged      | 2026-05-16 by port_until_julien_parallel |
| Bootstrapped | 2026-06-07 |

## Sourcing (Romain)

### Domain classification

`machine learning` (primary). This is a quantum machine learning
methodology and benchmarking study: quantum kernel support vector
machines (QSVMs) for supervised binary classification, with fidelity
kernels evaluated on simulators and IBM superconducting hardware.

The nine datasets are mostly biomedical and physical-science tabular
benchmarks (breast cancer, diabetes, ionosphere radar, sonar, etc.), so
there is an incidental `other` flavour, but those are standard SVM
benchmark sets chosen for their statistical profiles, not the subject of
study. The contribution is squarely QML, not a domain application; treat
the data domains as context only.

### Expert persona for Julien

A quantum machine learning researcher fluent in quantum kernel methods:
fidelity/overlap kernels $k_Q(\bm{x},\bm{z}) = |\braket{\psi(\bm{x})}{\psi(\bm{z})}|^2$,
data-encoding feature-map circuits (ZZFeatureMap and double-feature
rotation encodings), and kernel-target alignment for trainable kernels.
Should have hands-on Qiskit experience on IBM superconducting hardware
and calibrated intuition about qubit count versus feature dimension,
post-transpilation depth and two-qubit (CX/CZ) gate counts, shot budgets,
and depolarising noise models. The hardware fingerprint is stated
explicitly in Section 4.4 and Appendix C, so the expert needs to read it
off cleanly rather than re-derive circuit constructions.

### Related prior work (brief bibliography)

- Havlíček et al., "Supervised learning with quantum-enhanced feature
  spaces," *Nature* 567, 209 (2019). DOI: 10.1038/s41586-019-0980-2 -
  origin of the ZZFeatureMap fidelity kernel on IBM superconducting
  hardware; the canonical same-primitive-on-IBM reference.
- Peters et al., "Machine learning of high dimensional data on a noisy
  quantum processor," *npj Quantum Information* 7, 161 (2021). DOI:
  10.1038/s41534-021-00498-9 - same primitive (quantum kernel SVM) on
  Google Sycamore; the direct cross-vendor portability comparison.
- Schnabel & Roth, "Quantum kernel methods under scrutiny: a
  benchmarking study," *Quantum Machine Intelligence* 7, 58 (2025). DOI:
  10.1007/s42484-025-00273-5 - closest methodological predecessor
  (nested CV, statistical testing on QSVMs); the paper positions itself
  against it.
- Bowles, Ahmed & Schuld, "Better than classical? The subtle art of
  benchmarking quantum machine learning models," arXiv:2403.07059 (2024)
  - benchmarking discipline for QML against strong classical baselines
  across datasets.
- Thanasilp et al., "Exponential concentration in quantum kernel
  methods," *Nature Communications* 15, 5200 (2024). DOI:
  10.1038/s41467-024-49287-w - the mechanistic limit (kernel
  concentration with qubit count) behind the paper's spectral analysis.
- Huang et al., "Power of data in quantum machine learning," *Nature
  Communications* 12, 2631 (2021). DOI: 10.1038/s41467-021-22539-9 -
  classical-surrogate matching; frames when quantum kernels can beat
  classical methods at all.

### Difficulty estimate

`low`. The full LaTeX source sits in the dossier (`paper-source/`, with
sections plus appendices A–D); it is an open arXiv preprint, no paywall.
The hardware fingerprint is handed over explicitly: IBM ibm\_fez (Heron
r2, 156 qubits), native gate set $\{rz, sx, cz, x\}$, transpiler
`optimization_level=3`, 1\,024 shots per kernel entry, and a depolarising
noise model with $p_{1q} = 10^{-3}$ / $p_{2q} = 10^{-2}$ for the noisy
path. No exotic primitives - standard QSVM with four feature maps and
three backends (ideal statevector, noisy density matrix, hardware). The
main effort is breadth (four feature maps $\times$ three backends, plus
per-qubit-count circuit tables in Appendix B), not depth; each datum is
stated, so the extraction is volume rather than difficulty.