BLAS Performance Comparisons

Discussions center on benchmarking a new matrix library against optimized BLAS implementations like OpenBLAS, MKL, and Eigen, debating if it outperforms them in speed on various hardware and questioning the value of alternatives to established linear algebra libraries.

📉 Falling 0.4x Programming Languages

2,629

Comments

Years Active

Top Authors

#9586

Topic ID

Activity Over Time

2008

2009

2010

2011

2012

2013

127

2014

2015

110

2016

157

2017

178

2018

192

2019

174

2020

279

2021

211

2022

290

2023

220

2024

312

2025

176

2026

Top Contributors

gnufx (103) bee_rider (56) ChrisRackauckas (32) dragandj (31) adgjlsfhk1 (28)

Keywords

SKX ATLAS e.g CPU TR98751 HOW softlib.rice ASSEMBLY MATLAB godoc.org matrix performance benchmarks library matlab libraries numpy julia intel tuned

Sample Comments

go_elmo • Sep 12, 2021 • View on HN

Why exactly is this better than atlas / blas / any library using it, e.g. Eigen?

MobiusHorizons • May 12, 2023 • View on HN

BLAS is a very well optimized library. I think a lot of it is in Fortran, which can be faster than c. It is very heavily used in scientific compute. BLAS also has methods that have been hand tuned in assembly. It’s not magic, but the amount of work that has gone into it is not something you would probably want to replicate.

skidrow • Jul 11, 2024 • View on HN

are OpenBLAS and MKL not well optimized lol? They literally compared against OpenBLAS/MKL and posted the results in the article. As someone already mentioned, this implementation is faster than MKL even on a Intel Xeon with 96 cores. Maybe you missed the point, but the purpose of the arcticle was to show HOW to implement matmul without FORTRAN/ASSEMBLY code with NumPy-like performance. NOT how to write a BLIS-competitive library. So the article and the code seem to be LGTM.

p1esk • Sep 1, 2018 • View on HN

It would help to see performance benchmarks against blas or armadillo, etc.

martopix • Dec 8, 2022 • View on HN

I always hear this "fast matrix operations" argument from Matlab users, but don't they both use BLAS? The difference can only be marginal

agibsonccc • Dec 16, 2013 • View on HN

Change my matrix library to a BLAS binding for machine learning. 150x speedup

tycho01 • Aug 28, 2015 • View on HN

Uses BLAS but no mention of cuBLAS to speed things up? Does that mean the linear algebra wasn't big enough a component to merit optimizing on?

StefanKarpinski • Jan 30, 2022 • View on HN

Eagerly awaiting matrix libraries written in pure Python that outperform BLAS.

jey • Nov 17, 2015 • View on HN

You don't just call DGEMM from vendor BLAS?

willis936 • Apr 29, 2022 • View on HN

Don't all high performance math libraries have the option of LAPACK interfaces?