dev notes

Dev Note

These are general free form note with pointers to good jumping to point to under stand the llama.cpp codebase.

(@<symbol> is a vscode jump to symbol code for your convenience. Also making a feature request to vscode to be able to jump to file and symbol)

Where are the definitions?

GGUF file structure spec (WARN: As of 2024-06-11 the llama.cpp implementation is the canonical source for now)

All of the gguf structure can be found in gguf.c unless stated otherwise

GGUF Structure Of Interest	gguf.c reference	vscode search line
Overall File Structure	`struct gguf_context`	`@gguf_context`
File Header Structure	`struct gguf_header`	`@gguf_header`
Key Value Structure	`struct gguf_kv`	`@gguf_kv`
Tensor Info Structure	`struct gguf_tensor_info`	`@gguf_tensor_info`

Element of Interest (Think of this as an index lookup reference)

Please use this as an index not as canonical reference. The purpose of this table is to allow you to quickly locate major elements of the gguf file standard.

GGUF Elements Of Interest	c name	c type	gguf.c reference	vscode search line
Magic	magic	`uint8_t[4]`	`struct gguf_header`	`@gguf_header`
Version	version	`uint32_t`	`struct gguf_header`	`@gguf_header`
Tensor Count	n_tensors	`uint64_t`	`struct gguf_header`	`@gguf_header`
Key Value Count	n_kv	`uint64_t`	`struct gguf_header`	`@gguf_header`
Key Value Linked List	kv	`gguf_kv *`	`struct gguf_context`	`@gguf_context`
Tensor Info Linked List	infos	`gguf_tensor_info *`	`struct gguf_context`	`@gguf_context`
Key Value Entry - Key	gguf_kv.key	`gguf_str`	`struct gguf_kv`	`@gguf_kv`
Key Value Entry - Type	gguf_kv.type	`gguf_type`	`struct gguf_kv`	`@gguf_kv`
Key Value Entry - Type	gguf_kv.value	`gguf_value`	`struct gguf_kv`	`@gguf_kv`
Tensor Info Entry - Name	gguf_tensor_info.name	`gguf_str`	`struct gguf_tensor_info`	`@gguf_tensor_info`
Tensor Info Entry - Tensor shape dimension count	gguf_tensor_info.n_dim	`uint32_t`	`struct gguf_tensor_info`	`@gguf_tensor_info`
Tensor Info Entry - Tensor shape sizing array	gguf_tensor_info.ne	`uint64_t[GGML_MAX_DIMS]`	`struct gguf_tensor_info`	`@gguf_tensor_info`
Tensor Info Entry - Tensor Encoding Scheme / Strategy	gguf_tensor_info.type	`ggml_type`	`struct gguf_tensor_info`	`@gguf_tensor_info`
Tensor Info Entry - Offset from start of 'data'	gguf_tensor_info.offset	`uint64_t`	`struct gguf_tensor_info`	`@gguf_tensor_info`

Also just note that these values are not actually part of gguf but is there for internal usage and is calculated during model loading. Aka it's for the writing/reading api

GGML Elements Of Interest	c name	c type	gguf.c reference	vscode search line
Alignment	alignment	`size_t`	`struct gguf_context`	`@gguf_context`
Offset Of 'Data' From Beginning Of File	offset	`size_t`	`struct gguf_context`	`@gguf_context`
Size Of 'Data' In Bytes	size	`size_t`	`struct gguf_context`	`@gguf_context`

Users Guide

Useful information for users that doesn't fit into Readme.

Technical Details

These are information useful for Maintainers and Developers which does not fit into code comments

Github Actions Main Branch Status

Click on a badge to jump to workflow. This is here as a useful general view of all the actions so that we may notice quicker if main branch automation is broken and where.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly