Skip to content

question on lambdarank_num_pair_per_sample #11707

@randomgambit

Description

@randomgambit

Hello

Thanks for this amazing package. I have a question regarding the parameters I see for Xgbranker. I have been reading the documentation many times (https://xgboost.readthedocs.io/en/latest/tutorials/learning_to_rank.html ) and I am not sure if I am setting the lambdarank_num_pair_per_sample correctly.

My setup is simple, about 300 groups, eqch group has 50 documents, and about half of them (in each group) have relevance = 1, the rest is = 0 (so I use rank::map as metric). When I train the ranker, shouldnt we use all possible pairs in each group (given my small sample)? That is, setting lambdarank_num_pair_per_sample to a very high value? This seems at odds with the documentation. I am using the mean, not topk setting.

Thanks for clarifying this point.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions