Q-score benchmark
Benchmarking results
List of acronyms:
AC: Advanced Compilation method
CE: Constructor Evaluation (checked if the evaluation is done by the chip manufacturer)
EM: Error mitigation
ER: Erdös-Renyi
HS: Hybrid Solver
QA: Quantum Annealer
QC: Quantum Circuit
QCE: Quantm Computer Emulation
SA: Simulated Annealing
SP: Scientific paper (checked if a scientific paper explain the results)
TS: Tabu Search
UD: Unit Disk
Date & Ref | CE | SP | Problem | Graph class | #Instances | Shots | Time limit | Method | CPU / QPU | Score | Comment |
---|---|---|---|---|---|---|---|---|---|---|---|
2021/02 [1] | x | Max-Cut | ER G(n, p=1/2) | 100 | 2048 | / | QCE | Grid connectivity Depolarizing noise $$\epsilon_1=0.4 %$$$$\epsilon_2=2 %$$ | 11 | ||
2021/02 [1] | x | Max-Cut | ER G(n, p=1/2) | 100 | 2048 | / | QCE | Full connectivitydepolarizing noise$$\epsilon_1=0.4 %$$$$\epsilon_2=2 %$$ | 21 | ||
2022/08 [2] | x | Max-Cut | ER G(n, p=1/2) | 100 | 60s | TS | i7-7600u | 2300 | |||
2022/08 [2] | x | Max-Cut | ER G(n, p=1/2) | 100 | 60s | SA | i7-7600u | 5800 | |||
2022/08 [2] | x | Max-Cut | ER G(n, p=1/2) | 100 | 60s | QA | D-Wave 2000Q | 70 | |||
2022/08 [2] | x | Max-Cut | ER G(n, p=1/2) | 100 | 60s | QA | D-Wave Advantage | 140 | |||
2022/08 [2] | x | Max-Cut | ER G(n, p=1/2) | 100 | 60s | QA | D-Wave hybrid solver | 12500 | |||
2022/07 [3] | x | Max-Cut | ER G(n, p=1/2) | 500 for n=610 to for n=16 | 1000 | / | QCE | Pasqal Pulser library | 18 | ||
2022/07 [3] | x | MIS | UD G(n, p=1/2) | 500 for n=610 to for n=16 | 1000 | / | QCE | Pasqal Pulser library | 18 | ||
2023/02 [4] | x | Max-clique | ER G(n, p=1/2) | 100 | 60s | TS | i7-7600u | 4900 | |||
2023/02 [4] | x | Max-clique | ER G(n, p=1/2) | 100 | 60s | SA | i7-7600u | 9100 | |||
2023/02 [4] | x | Max-clique | ER G(n, p=1/2) | 100 | 60s | QA | D-Wave 2000Q | 70 | |||
2023/02 [4] | x | Max-clique | ER G(n, p=1/2) | 100 | 60s | QA | D-Wave Advantage | 110 | |||
2023/02 [4] | x | Max-clique | ER G(n, p=1/2) | 100 | 60s | HS | D-Wave hybrid solver | 12500 | |||
2023/02 [4] | x | Max-clique | ER G(n, p=1/2) | 100 | 60s | QC | QuTech Starmon-5 | 5 | |||
2023/02 [4] | x | Max-clique | ER G(n, p=1/2) | 100 | 60s | QC | IBM Gadalupe | 5 | |||
2023/10 [5] | x | Max-Cut | ER G(n, p=1/2) | 100 | / | QC | IQM-20 | 8 | |||
2024/02 [6] | x | x | Max-Cut | ER G(n, p=1/2) | 100 | 2048 | / | QC | IQM Spark | 6 | |
2024/02 [7] | x | Max-Cut | ER G(n, p=1/2) | 100 | 2048 | / | QC | IQM-20 | 11 | ||
2024 [8] | x | Max-Cut | / | / | QC | Quandela Altaïr | 6 | EM & AC |
Motivation
The Q-score protocol, introduced by S. Martiel et al. in 2021 [1] aims to establish an application-centric, hardware agnostic, and scalable figure of merit for assessing the performance of quantum computers. A key motivation behind this protocol is to identify challenging real-world optimization problems whose solution processes could be accelerated through quantum computing.
Protocol
The Q-score protocol measures the maximum size of Maxcut instances that can be solved effectively on a quantum computer. The main idea is to pick a class of random graphs (e.g., Erdös-Renyi graphs with \(n\) nodes and a specific edge probability) and to analytically derive the maximum cut size \(C_{max}(n)\) and average random cut size \(C_{rand}(n)\) for this class. The average expectation value obtained from the sampling of the quantum computer \(C^Q(n)\) is used to compute the ratio \(\beta(n)\):
\[\beta(n) = \frac{C^Q(n) - C_{rand}(n)}{C_{max}(n) - C_{rand}(n)}\]The quantum computer passes the Q-score for instances of size \(n\) if \(\beta(n) > \beta^*\) (\(\beta^* \in ]0,1[\)), where \(\beta^*\) represents the level of difficulty of the test. The formula used to compute the ratio depends on the class of studied graphs. For their experiments, the authors arbitrarily set \(\beta^*=0.2\).
The final value of the score is defined by:
\[n^* = \max \{ n \in \mathbb{N}, \beta(n) > \beta^* \}\]This score can be extended to other problems, it only depends on the ability of finding derivation of the values \(C_{max}(n)\) and \(C_{rand}(n)\).
Assumptions
- The Q-score does not have runtime limit, but considers that the classical algorithm implementation runs in polynomial time. Subsequent studies has proposed a 60s runtime limit for each instance (see [4]).
- no assumptions on compilation processing (but the implementation should run in polynomial time).
- no assumptions on error mitigation technique (but the implementation should run in polynomial time).
Configuration of experiments
- Size of instance set recommended: 100
- Number of shots recommended: 2048
- Default level of difficulty: \(\beta^* = 0.2\)
Implementations
Several implementations of the Q-Score are available on GitHub:
- The initial implementation created by the authors of the Q-score is available here.
- A tutorial implementation from the QCMet software repository [9] is available here.
- An implementation from the IQM Benchmarks suite is available here.
Extensions to the Q-score
Max-clique extension
In [4], the authors extend the Q-Score to the Max-clique problem. For Erdös-Renyi graphs \(G(n, p)\), the average random cost \(C_{rand}(n)\) is set to (proof in [4]):
\[C_{rand}(n) = \sum_{i=1}^n i \times (1-p^i)p^{i(i-1)/2}\]For this kind of problem, \(C_{max}\) is set to (proof in [10]):
\[C_{max}(n) = 2 \log_2(n) - 2 \log_2(\log_2(n)) + 2 \log_2\left(\frac{e}{2}\right) +1\]An implementation of the Q-score Max-clique is available here.
Empirical optimal and random average cost
In [3], the authors propose determining \(C_{max}(n)\) for arbitrary problems using exact methods, thereby circumventing the need to derive theoretical bounds for both \(C_{max}\) and \(C_{rand}\). While this approach expands the range of problems to which the Q-score can be applied, it does so at the cost of reduced scalability.
References
- [1]S. Martiel, T. Ayral, and C. Allouche, “Benchmarking quantum coprocessors in an application-centric, hardware-agnostic, and scalable way,” IEEE Transactions on Quantum Engineering, vol. 2, pp. 1–11, 2021.
- [2]W. van der Schoot, D. Leermakers, R. Wezeman, N. Neumann, and F. Phillipson, “Evaluating the Q-score of Quantum Annealers,” in 2022 IEEE International Conference on Quantum Software (QSW), 2022, pp. 9–16.
- [3]W. da S. Coelho, M. D’Arcangelo, and L.-P. Henry, “Efficient protocol for solving combinatorial graph problems on neutral-atom quantum processors,” arXiv preprint arXiv:2207.13030, 2022.
- [4]W. van der Schoot, R. Wezeman, N. M. P. Neumann, F. Phillipson, and R. Kooij, “Q-score Max-Clique: The First Quantum Metric Evaluation on Multiple Computational Paradigms,” arXiv preprint arXiv:2302.00639, 2023.
- [5]IQM, “Finland launches a 20-qubit quantum computer - development towards more powerful quantum computers continues.” 2023 [Online]. Available at: https://www.meetiqm.com/newsroom/press-releases/finland-launches-a-20-qubit-quantum-computer
- [6]J. Rönkkö et al., “On-premises superconducting quantum computer for education and research,” EPJ Quantum Technology, vol. 11, no. 1, p. 32, 2024.
- [7]IQM, “IQM quantum computers achieves a new benchmark result on 20-qubit quantum computer.” 2024 [Online]. Available at: https://www.meetiqm.com/newsroom/press-releases/iqm-achieves-20-qubit-benchmark-result
- [8]Quandela, “Introducing Quandela cloud 2.0.” 2024 [Online]. Available at: https://www.quandela.com/cloud/
- [9]D. Lall et al., “A review and collection of metrics and benchmarks for quantum computers: definitions, methodologies and software,” arXiv preprint arXiv:2502.06717, 2025.
- [10]D. W. Matula, The largest clique size in a random graph. Department of Computer Science, Southern Methodist University Dallas, Texas …, 1976.