Merging v1.3 development changes into master #420

ugorji · 2025-05-27T14:46:35Z

codec v1.3 uses generics and monomorphization to get much improved performance without the need for seldom-used codecgen. Performance numbers are impressive.

Removing codecgen dramatically reduces the work involved in making changes (where each change has to be effectively supported in codecgen mode).

Actions: - remove codecgen (we can get most of the perf guarantees by better inlining/generics) - remove other modules: go/ go/codec/bench go/codec/codecgen (keeping only go/codec) - remove sort helpers (use generics and slices.XXX) - remove cross-lib benchmarks out into repo: go-codec-bench - refactor testing and benchmarks for easier sharing with go-codec-bench

…rmance Key changes: - try to parameterize around: (En|De)coder --> (En|Dec)Driver --> (IO/Byte)(Reader|Writer) - global shared state is managed in Handle - Do not store state: only support restoring to initial state - For side encode/decode, each (En|De)coder will keep a byte encoder/decoder for side functions We now have simple format working. We will then upgrade other drivers and enable them.

This allows us easily exclude files when working on a specific format.

Use generics to refactor some code out to helper parameterized functions for sharing with other formats.

decoder.init(Handle) -> driver.init(Handle) --> reader.init() decoder.reset() --> driver.reset() ?? is this needed??? maybe not. We only ever call .reset decoder.reset(bytes/io) --> driver.reset(bytes/io) --> reader.reset(bytes/io) driver.init() will call Make() on its reader/writer decoder.init() will call Make() on its driver Move side(En|De)coder, rtidFn(s) into the shared(En|De)coder, so that it is used by drivers, decoder.

- remove isBytes() from decReader and encWriter. Instead, at creation of a decoder, we will (based on the context), determine whether bytes or io. - clean up how code is shared between formats. keep all those shared code at bottom of file, so we just change the names.

The reflect types are needed for some tests

We don't need to build a different one for each encoder/decoder. Instead, all encoder/decoder share it globally, and it is initialized at startup.

It is mostly used for extensions. Folks without extensions shoulc not have to pay the price for it.

- multiple panicHdl.errorf callers should be replaced by a method that doesn't call fmt.Errorf - sideEncode/sideDecode should use a reflect.Value directly (as opposed to wrapping it in a interface{}) - instead of creating a new string for a single byte, look it up in a pre-allocated string and substring that

This dramatically reduces alloc and improves run time.

…c monomorphization

Per GC guide, it's best to put pointers and values containing pointers (e.g. interfaces, some slices of pointers, fields that are values containing pointers, etc) together around the top of a struct. This reduces GC pressure, since a value will only be scanned up to the last pointer containing part of it.

…p referencing generics everywhere

…n.go)

…es, etc

…mentary

…-bufio-1024

…other parameters

…stly - readn{2,3,4,8} return was escaping to heap as we called readb internally - - now call readxb, so we just return a slice and convert that directly to an array - fillbuf was previously pre-allocating if it saw that less than bufsize may be available on a write. - - now, fillbuf only allocates when z.wc == len, and not enough left in capacity to handle another bufsize write. Both changes improved performance for binary handles using bufio by a fair amount (~10-15%).

Instead, benchmarks pass in -tzc or configure testZeroCopy=true directly

…ncode/encodeValue methods

- detach2Bytes no longer tries to decode into a secondary slice - decodeBytesInto will let you know if you decoded into a diff slice (dBytesIntoState), and takes a mustFit parameter to ensure that you can copy the bytes into it (or raise an error) - introduce tmpCopyBytes for copying field names (of < 56 chars) into a temp slice for transient use - encode(...): do not handle nil or reflect.Value (encodeValue handles all that)

- introduce usableStructFieldNameBytes that may copy the struct field to prevent overwrite - simplify isCanTransient and its use - use aliases in helper_(unsafe|not_unsafe) - simplify oneShotAddrRV to just check if flagCanTransient is set

- use regular for-loop and not range-over-int - update go.mod to 1.21

…f file

…vel modifications

Ugorji Nwoke and others added 30 commits March 1, 2025 11:06

codec: v1.3: remove go/ module

6e8ea50

codec: separate codec_test.go into format specific files

c4d074d

This allows us easily exclude files when working on a specific format.

codec: refactor some shared functions out of simple.go

411f5d2

Use generics to refactor some code out to helper parameterized functions for sharing with other formats.

codec: add encodeAs/decodeAs to support json's extension support

81aff20

codec: fix json support

0f0a67d

codec: cleanup

87b863b

- remove isBytes() from decReader and encWriter. Instead, at creation of a decoder, we will (based on the context), determine whether bytes or io. - clean up how code is shared between formats. keep all those shared code at bottom of file, so we just change the names.

codec: fixes for fast-path using type parameters

e0f11a4

codec: fixed support for all formats: binc, json, msgpack, simple, cbor

d1ef255

codec: eliminate escape of a slice when looking up enc fn

1d7e0a5

codec: update fastpath to support storing the reflect types during init

415293c

The reflect types are needed for some tests

codec: support benchmarking

81218e5

codec: use global vars for the fastpathES/DS values

3c5dc81

We don't need to build a different one for each encoder/decoder. Instead, all encoder/decoder share it globally, and it is initialized at startup.

codec: lazy initialize side Decoder

ed2a5fe

It is mostly used for extensions. Folks without extensions shoulc not have to pay the price for it.

codec: nit

0da812a

codec: remove unused isBytes methods on format drivers

5a677ee

codec: comment out driverStateManager's (restore|capture)State methods

d925103

codec: removed old commented blocks from simple.go

bf22cb1

codec: callMake calls through an interface and not reflect

3c33118

This dramatically reduces alloc and improves run time.

codec: cleanups, use better nil value for reset

d7196d5

codec: nit (remove comments)

90c2a32

codec: comment out some delegate methods done to try and force generi…

fd91e07

…c monomorphization

codec: nit

f0bce77

codec: make bigenWriter[T] a field of encDriver, so that we don't kee…

7fd8db4

…p referencing generics everywhere

codec: rename rtidfn.go to enc_rtidfn.go (for symmetry with dec_rtidf…

384ae68

…n.go)

codec: remove bigenWriter[T] and just inline calls to writen directly

809b935

ugorji added 29 commits May 20, 2025 14:53

codec: missed some places for renaming decoderBase.string --> detach2Str

83d6b7d

codec: notJsonType is only type with isJson method outside jsonHandle

6f37e7e

codec: update generated files

a19d73f

codec: tests: do not run tests in parallel if they are mutating Handl…

dc390dc

…es, etc

codec: nit: mv commented out code to bottom of file, and add some com…

410fc29

…mentary

codec: set debugLogging=true (we need it a lot during dev)

d569253

codec: nit

9dc79d6

codec: binc: report right bytesAttachState for DecodeStringAsBytes

d62a08a

codec: z_all_test: use defers to reset mutated Handle settings

6796b49

codec: bench: consistently call benchmark for bufio by bufsize ie use…

1185a74

…-bufio-1024

codec: bench: configure handles during reinit also

b61a8af

codec: introduce bytesOK and bytesOKs for getting bytes and ignoring …

a928055

…other parameters

codec: json: lens of array is a constant - no need to cache it elsewhere

de320f8

codec: nit on renaming benchmark functions, et al

ccff962

codec: bench: do not save cmd-line args outside of func scope

1f23000

codec: updated mono generated files

603488a

codec: bench: do not hardcode ZeroCopy=true

5176cbc

Instead, benchmarks pass in -tzc or configure testZeroCopy=true directly

codec: remove circularRefChecker.pushRV and handle it inline within e…

c2c102a

…ncode/encodeValue methods

codec: streamline transient use

56225c4

- introduce usableStructFieldNameBytes that may copy the struct field to prevent overwrite - simplify isCanTransient and its use - use aliases in helper_(unsafe|not_unsafe) - simplify oneShotAddrRV to just check if flagCanTransient is set

codec: change name to TestBenchOnePass and run it even if no verbose

b37cced

codec: changes to support go1.21 minimum

33c6f5a

- use regular for-loop and not range-over-int - update go.mod to 1.21

codec: support growslice from go1.21

3da8748

codec: rename file to goversion_check_supported

aa5c9fd

codec: ensure that codec package is only used for go1.21+

7dd8b91

codec: clean up comments, removing some and moving others to bottom o…

5d57afb

…f file

codec: updated generated mono files

d2493f9

Resolve merge conflict by deleting unnecessary files and accepting de…

fc87261

…vel modifications

ugorji merged commit c121472 into master May 27, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Merging v1.3 development changes into master #420

Merging v1.3 development changes into master #420

Uh oh!

ugorji commented May 27, 2025

Uh oh!

Uh oh!

Uh oh!

Merging v1.3 development changes into master #420

Merging v1.3 development changes into master #420

Uh oh!

Conversation

ugorji commented May 27, 2025

Uh oh!

Uh oh!

Uh oh!