Expand description
sonic-rs
A fast Rust JSON library based on SIMD. It has some references to other open-source libraries like sonic_cpp, serde_json, sonic, simdjson and rust-lang.
Requirements/Notes
- Support x86_64 or aarch64. Note that the performance in aarch64 is low and it need to optimize.
- Rust nightly version. Because we use the
packed_simdcrate. - Not validating the UTF-8 when parsing from slice by default. You can add the
utf8feature to enable the validation. The performance loss is about 3% ~ 10%.
Features
- Serde into Rust struct as
serde_jsonandserde - Parse/Serialize JSON for untyped document, and document can be mutable
- Get specific fields from a JSON with blazing performance
- Use JSON as a lazied array or object iterator
Benchmark
Architecture: x86_64
Model name: Intel(R) Xeon(R) Platinum 8260 CPU @ 2.40GHzDeserialize Struct
cargo bench --bench deserialize_struct --features utf8 -- --quiet
twitter/sonic_rs::from_slice
time: [718.60 µs 724.47 µs 731.05 µs]
twitter/simd_json::from_slice
time: [1.0325 ms 1.0486 ms 1.0664 ms]
twitter/serde_json::from_slice
time: [2.3070 ms 2.3271 ms 2.3506 ms]
twitter/serde_json::from_str
time: [1.3797 ms 1.3996 ms 1.4237 ms]
citm_catalog/sonic_rs::from_slice
time: [1.3413 ms 1.3673 ms 1.3985 ms]
citm_catalog/simd_json::from_slice
time: [2.3324 ms 2.4122 ms 2.4988 ms]
citm_catalog/serde_json::from_slice
time: [3.0485 ms 3.0965 ms 3.1535 ms]
citm_catalog/serde_json::from_str
time: [2.4495 ms 2.4661 ms 2.4836 ms]
canada/sonic_rs::from_slice
time: [4.3249 ms 4.4713 ms 4.6286 ms]
canada/simd_json::from_slice
time: [8.3872 ms 8.5095 ms 8.6519 ms]
canada/serde_json::from_slice
time: [6.5207 ms 6.5938 ms 6.6787 ms]
canada/serde_json::from_str
time: [6.6534 ms 6.8373 ms 7.0402 ms]Deserialize Untyped
cargo bench --bench deserialize_value --features utf8 -- --quiet
twitter/sonic_rs_dom::from_slice
time: [624.60 µs 631.67 µs 639.76 µs]
twitter/simd_json::slice_to_borrowed_value
time: [1.2524 ms 1.2784 ms 1.3083 ms]
twitter/serde_json::from_slice
time: [4.1991 ms 4.3552 ms 4.5264 ms]
twitter/serde_json::from_str
time: [3.0258 ms 3.1086 ms 3.2005 ms]
twitter/simd_json::slice_to_owned_value
time: [1.8195 ms 1.8382 ms 1.8583 ms]
citm_catalog/sonic_rs_dom::from_slice
time: [1.8528 ms 1.8962 ms 1.9452 ms]
citm_catalog/simd_json::slice_to_borrowed_value
time: [3.5543 ms 3.6127 ms 3.6814 ms]
citm_catalog/serde_json::from_slice
time: [9.0163 ms 9.2052 ms 9.4167 ms]
citm_catalog/serde_json::from_str
time: [8.0306 ms 8.1450 ms 8.2843 ms]
citm_catalog/simd_json::slice_to_owned_value
time: [4.2538 ms 4.3171 ms 4.3990 ms]
canada/sonic_rs_dom::from_slice
time: [5.2105 ms 5.2761 ms 5.3474 ms]
canada/simd_json::slice_to_borrowed_value
time: [12.557 ms 12.773 ms 13.031 ms]
canada/serde_json::from_slice
time: [14.875 ms 15.073 ms 15.315 ms]
canada/serde_json::from_str
time: [14.603 ms 14.868 ms 15.173 ms]
canada/simd_json::slice_to_owned_value
time: [12.548 ms 12.637 ms 12.737 ms]Serialize Untyped
cargo bench --bench serialize_value -- --quiet
twitter/sonic_rs::to_string
time: [380.90 µs 390.00 µs 400.38 µs]
twitter/serde_json::to_string
time: [788.98 µs 797.34 µs 807.69 µs]
twitter/simd_json::to_string
time: [965.66 µs 981.14 µs 998.08 µs]
citm_catalog/sonic_rs::to_string
time: [805.85 µs 821.99 µs 841.06 µs]
citm_catalog/serde_json::to_string
time: [1.8299 ms 1.8880 ms 1.9498 ms]
citm_catalog/simd_json::to_string
time: [1.7356 ms 1.7636 ms 1.7972 ms]
canada/sonic_rs::to_string
time: [6.5808 ms 6.7082 ms 6.8570 ms]
canada/serde_json::to_string
time: [6.4800 ms 6.5747 ms 6.6893 ms]
canada/simd_json::to_string
time: [7.3751 ms 7.5690 ms 7.7944 ms]Serialize Struct
cargo bench --bench serialize_struct -- --quiet
twitter/sonic_rs::to_string
time: [434.03 µs 448.25 µs 463.97 µs]
twitter/simd_json::to_string
time: [506.21 µs 515.54 µs 526.35 µs]
twitter/serde_json::to_string
time: [719.70 µs 739.97 µs 762.69 µs]
canada/sonic_rs::to_string
time: [4.6701 ms 4.7481 ms 4.8404 ms]
canada/simd_json::to_string
time: [5.8072 ms 5.8793 ms 5.9625 ms]
canada/serde_json::to_string
time: [4.5708 ms 4.6281 ms 4.6967 ms]
citm_catalog/sonic_rs::to_string
time: [624.86 µs 629.54 µs 634.57 µs]
citm_catalog/simd_json::to_string
time: [624.10 µs 633.55 µs 644.78 µs]
citm_catalog/serde_json::to_string
time: [802.10 µs 814.15 µs 828.10 µs]Get from JSON
cargo bench --bench get_from -- --quiet
twitter/sonic-rs::get_from_str
time: [79.432 µs 80.008 µs 80.738 µs]
twitter/gjson::get time: [344.41 µs 351.36 µs 362.03 µs]Usage
Serde into Rust Type
Directly use the Deserialize or Serialize trait, recommended use sonic_rs::{Deserialize, Serialize}.
use sonic_rs::{Deserialize, Serialize};
// or use serde::{Deserialize, Serialize};
#[derive(Serialize, Deserialize)]
struct Person {
name: String,
age: u8,
phones: Vec<String>,
}
fn main() {
let data = r#"{
"name": "Xiaoming",
"age": 18,
"phones": [
"+123456"
]
}"#;
let p: Person = sonic_rs::from_str(data).unwrap();
assert_eq!(p.age, 18);
assert_eq!(p.name, "Xiaoming");
let out = sonic_rs::to_string_pretty(&p).unwrap();
assert_eq!(out, data);
}
Get a field from JSON
Get a specific field from a JSON with the pointer path. The return is a LazyValue, which is a wrapper of a raw JSON slice. Note that the JSON must be valid and well-formed, otherwise it may return unexpected result.
use sonic_rs::{get_from_str, pointer, JsonValue, PointerNode};
fn main() {
let path = pointer!["a", "b", "c", 1];
let json = r#"
{"u": 123, "a": {"b" : {"c": [null, "found"]}}}
"#;
let target = get_from_str(json, path.iter()).unwrap();
assert_eq!(target.as_raw_str(), r#""found""#);
assert_eq!(target.as_str().unwrap(), "found");
let path = pointer!["a", "b", "c", "d"];
let json = r#"
{"u": 123, "a": {"b" : {"c": [null, "found"]}}}
"#;
// not found from json
let target = get_from_str(json, path.iter());
assert!(target.is_err());
}
Parse and Serialize into untyped Value
Parse a JSON as a document, and the document is mutable.
use sonic_rs::value::{dom_from_slice, Value};
use sonic_rs::PointerNode;
use sonic_rs::{pointer, JsonValue};
fn main() {
let json = r#"{
"name": "Xiaoming",
"obj": {},
"arr": [],
"age": 18,
"address": {
"city": "Beijing"
},
"phones": [
"+123456",
]
}"#;
let mut dom = dom_from_slice(json.as_bytes()).unwrap();
// get the value from dom
let root = dom.as_value();
// get key from value
let age = root.get("age").as_i64();
assert_eq!(age.unwrap_or_default(), 18);
// get by index
let first = root["phones"][0].as_str().unwrap();
assert_eq!(first, "+123456");
// get by pointer
let phones = root.pointer(&pointer!["phones", 0]);
assert_eq!(phones.as_str().unwrap(), "+123456");
// convert to mutable object
let mut obj = dom.as_object_mut().unwrap();
let value = Value::new_bool(true);
obj.insert("inserted", value);
assert!(obj.contains_key("inserted"));
}
JSON Iterator
Parse a object or array JSON into a iterator. The item of iterator is the LazyValue, which is wrapper of a raw JSON slice.
use bytes::Bytes;
use sonic_rs::{to_array_iter, JsonValue};
fn main() {
let json = Bytes::from(r#"[1, 2, 3, 4, 5, 6]"#);
let iter = to_array_iter(&json);
for (i, v) in iter.enumerate() {
assert_eq!(i + 1, v.as_u64().unwrap() as usize);
}
let json = Bytes::from(r#"[1, 2, 3, 4, 5, 6"#);
let mut iter = to_array_iter(&json);
for _ in iter.iter() {}
// deal with errors when invalid json
let ret = iter.take_result();
assert_eq!(
ret.as_ref().err().unwrap().to_string(),
"Expected this character to be either a ',' or a ']' while parsing at line 1 column 17"
);
}
Contributing
Please read CONTRIBUTING.md for information on contributing to sonic-rs.
Re-exports
Modules
Macros
Structs
- This type represents all possible errors that can occur when serializing or deserializing JSON data.
Enums
- JsonPointer reprsents a json path. You can use
jsonpointer!["a", "b", 1]represent a json path. It means that we will get the json field from.a.b.1. Note: the key in jsonpointer should be unescaped.
Traits
- PointerTrait is a trait for the node in json pointer path.
Functions
Type Aliases
- Alias for a
Resultwith the error typesonic_rs::Error.