Skip to content

Where is the InfoSeek-Evaluation set? #10

@nv-aken12

Description

@nv-aken12

Hi, thank you for your great work!!
I read InfoSeek and InfoFlow papers.

In the second paper (InfoFlow), I found a paragraph that states InfoSeek has an Evaluation set containing 300 queries:

InfoSeek-Evaluation The InfoSeek-Evaluation set contains 300 high-quality, human-checked
samples to evaluate agentic deep search capability. Qwen2.5-72B-Instruct with a CoT prompting
achieves lower than 8% accuracy in this evaluation set.

However, I could not find this evaluation set in the InfoSeek repository or its HuggingFace repository.
Could you please let me know where I can access the InfoSeek-Evaluation set?

Thank you for your time, and I look forward to your response.

References:

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions