Issue: Replace iterrows().to_dict() with apply(...).tolist() for better performance

https://github.com/DataKitchen/dataops-testgen/blob/418ec3a21ed3c7653e8859c97e6a06a37140ed0a/testgen/ui/views/data_catalog.py#L279
Current implementation:
`data = [row.to_dict() for _, row in df.iterrows()]`
Recommended replacement:
`data = df.apply(lambda row: row.to_dict(), axis=1).tolist()`
Using iterrows() introduces overhead because each row is returned as a Series object and to_dict() is repeatedly called in pure Python. This approach creates a large number of temporary objects and results in slow performance when the DataFrame becomes large.

By contrast, df.apply(lambda row: row.to_dict(), axis=1) keeps the row-wise transformation within Pandas' optimized Cython internals. Although still row-based, this method reduces Python-level overhead and improves performance while preserving the same output structure: List[Dict[str, Any]].


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Issue: Replace iterrows().to_dict() with apply(...).tolist() for better performance #35

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue: Replace iterrows().to_dict() with apply(...).tolist() for better performance #35

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions