[WIP] Add TrueQuery QA Generation #10559

askliar · 2025-12-15T18:39:00Z

This PR introduces QA generation procedure, a comprehensive LLM-powered pipeline for generating high-quality question-answer pairs from text documents. The system is designed to create training data for retrieval-augmented generation (RAG) systems and question-answering models.

for more information, see https://pre-commit.ci

…h_geometric

for more information, see https://pre-commit.ci

puririshi98 · 2025-12-16T01:27:01Z

first step, address whatever complaints the CI linter has: https://results.pre-commit.ci/run/github/106024057/1765827433.VRbWJ22KTYipZNbGq422sQ
and update the changelog:
https://github.com/askliar/pytorch_geometric/blob/master/CHANGELOG.md

puririshi98 · 2025-12-16T01:32:51Z

upon quick review no glaring issues. Looking forward to the final restructuring before reviewing deeper and also involving Vibhor for review.
One thing i would ask is that when the final restructuring is done, please attach a log w a succesful run of the full example in the latest pyg container: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/pyg/tags

codecov · 2025-12-16T01:33:58Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 81.35%. Comparing base (c211214) to head (9919739).
⚠️ Report is 142 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #10559      +/-   ##
==========================================
- Coverage   86.11%   81.35%   -4.76%     
==========================================
  Files         496      511      +15     
  Lines       33655    37615    +3960     
==========================================
+ Hits        28981    30603    +1622     
- Misses       4674     7012    +2338

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Andrii Skliar added 2 commits December 15, 2025 19:34

initial truequery_qa_gen commit

6373868

Merge branch 'master' of https://github.com/askliar/pytorch_geometric

c21a068

askliar requested a review from puririshi98 as a code owner December 15, 2025 18:39

askliar changed the title ~~WIP: Add TrueQuery QA Generation~~ Draft: Add TrueQuery QA Generation Dec 15, 2025

pre-commit-ci bot and others added 7 commits December 15, 2025 18:40

[pre-commit.ci] auto fixes from pre-commit.com hooks

a3806bb

for more information, see https://pre-commit.ci

cleanup file structure

742ed00

Merge branches master and master of https://github.com/askliar/pytorc…

b8ada09

…h_geometric

[pre-commit.ci] auto fixes from pre-commit.com hooks

1e2cca6

for more information, see https://pre-commit.ci

Update imports in qa_gen.py to include TypedDict and OpenAI module

54b3c89

Merge branch 'master' of https://github.com/askliar/pytorch_geometric

9e946e8

[pre-commit.ci] auto fixes from pre-commit.com hooks

9919739

for more information, see https://pre-commit.ci

askliar changed the title ~~Draft: Add TrueQuery QA Generation~~ [WIP] Add TrueQuery QA Generation Dec 16, 2025

askliar marked this pull request as draft December 16, 2025 09:27

improve formatting

c4e3570

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Add TrueQuery QA Generation #10559

[WIP] Add TrueQuery QA Generation #10559

askliar commented Dec 15, 2025

Uh oh!

puririshi98 commented Dec 16, 2025

Uh oh!

puririshi98 commented Dec 16, 2025

Uh oh!

codecov bot commented Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[WIP] Add TrueQuery QA Generation #10559

Are you sure you want to change the base?

[WIP] Add TrueQuery QA Generation #10559

Conversation

askliar commented Dec 15, 2025

Uh oh!

puririshi98 commented Dec 16, 2025

Uh oh!

puririshi98 commented Dec 16, 2025

Uh oh!

codecov bot commented Dec 16, 2025

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants