question on lambdarank_num_pair_per_sample

Hello

Thanks for this amazing package. I have a question regarding the parameters I see for Xgbranker. I have been reading the documentation many times (https://xgboost.readthedocs.io/en/latest/tutorials/learning_to_rank.html ) and I am not sure if I am setting the lambdarank_num_pair_per_sample correctly.

My setup is simple, about 300 groups, eqch group has 50 documents, and about half of them (in each group) have relevance = 1, the rest is = 0 (so I use rank::map as metric). When I train the ranker, shouldnt we use all possible pairs in each group (given my small sample)? That is, setting lambdarank_num_pair_per_sample to a very high value? This seems at odds with the documentation. I am using the mean, not topk setting.

Thanks for clarifying this point.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

question on lambdarank_num_pair_per_sample #11707

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

question on lambdarank_num_pair_per_sample #11707

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions