JSON Parsing Performance

The cluster centers on debates about whether JSON parsing is a performance bottleneck, especially for large documents, with frequent mentions of fast libraries like simdjson achieving GB/s speeds and advice on profiling and alternatives.

➡️ Stable 0.5x Open Source
2,085
Comments
19
Years Active
5
Top Authors
#110
Topic ID

Activity Over Time

2008
6
2009
11
2010
28
2011
42
2012
44
2013
59
2014
64
2015
101
2016
114
2017
89
2018
78
2019
213
2020
200
2021
231
2022
171
2023
236
2024
189
2025
184
2026
25

Keywords

CPU S3 JS MM UTF8 END lemire.me MB DDTHH XML json parsing parse parser streaming bottleneck file slow data performance

Sample Comments

keypusher Jul 24, 2011 View on HN

If JSON parsing is a bottleneck in your application, you're doing it wrong.

wereHamster Jun 23, 2020 View on HN

Is JSON parsing still a bottleneck? https://github.com/simdjson/simdjson

craigching Nov 5, 2023 View on HN

Probably anywhere that requires parsing large JSON documents. Off the shelf JSON parsers are notoriously slow on large JSON documents.

ec109685 May 27, 2019 View on HN

What is the reason not to use the micro optimized JSON implementation if parsing becomes your bottleneck?

numlock86 Aug 6, 2020 View on HN

I guess it depends in your use-case. Looks like this was primarily made for large JSON files and not the typical small JSON payloads you encounter with HTTP bodies and the like. On top of that JSON.parse() is pretty heavily optimized already. Profiling is key.

glangdale Feb 21, 2019 View on HN

There are some quite big JSON files out there; you might also be interested in parsing megabytes but not spending more than 1ms to get through it.

loeg Nov 5, 2023 View on HN

You might also move to something other than JSON if parsing it is a significant part of your workload.

chadaustin Mar 11, 2020 View on HN

It can be a big chunk of the cost of parsing JSON.

pjmlp Dec 9, 2014 View on HN

If at least they had used a JSON parser instead of a full blown (slow) language!

323 Nov 26, 2022 View on HN

Further evidence is the fact that optimized SIMD JSON or UTF8 libraries exist. If I/O was the bottleneck, there wouldn't be a need to parse JSON using SIMD.