beir/touche

목차

1. 사용법

1.1. 모든 데이터 순회

1.2. 개별 데이터 접근

2. 속성

2.1. doc

2.2. query

2.3. qrel

3. 통계

4. 인용

5. 출처

6. 라이센스



1. 사용법

1.1. 모든 데이터 순회

from hamu_tool.dataset import DataLoader

loader = DataLoader.load('beir/touche')

for doc in loader.get_docs():
    print(doc.id, doc.text, doc.title, doc.stance, doc.url)
    break

for query in loader.get_queries():
    print(query.id, query.text, query.description, query.narrative)
    break

for qrel in loader.get_qrels('[mode]'):
    print(qrel.qid, qrel.did, qrel.score)
    break

1.2. 개별 데이터 접근

from hamu_tool.dataset import DataLoader

loader = DataLoader.load('beir/touche')

doc = loader.get_doc('[did]')
print(doc)

query = loader.get_query('[qid]')
print(query)

qrel = loader.get_qrel('[mode]', '[qid]')
print(qrel)

2. 속성

2.1. doc

속성자료형
idstr
textstr
titlestr
stancestr
urlstr

2.2. query

속성자료형
idstr
textstr
descriptionstr
narrativestr

2.3. qrel

속성자료형
qidstr
didstr
scoreint
  • [mode]: test

3. 통계

수치
TaskArgument Retrieval
DomainMisc.
# Query49
# Doc382,544
# Qreltest2,403
Average Rel D/Qtest49.04
Average Query Length (words)6.55
Average Doc Length (words)286.46

4. 인용

@inproceedings{bondarenko2020overview,
  title = "Overview of Touch{\'e} 2020: argument retrieval",
  author = {Bondarenko, Alexander and Fr{\"o}be, Maik and Beloucif, Meriem and Gienapp, Lukas and Ajjour, Yamen and Panchenko, Alexander and Biemann, Chris and Stein, Benno and Wachsmuth, Henning and Potthast, Martin and others},
  booktitle = "Experimental IR Meets Multilinguality, Multimodality, and Interaction: 11th International Conference of the CLEF Association, CLEF 2020, Thessaloniki, Greece, September 22--25, 2020, Proceedings 11",
  pages = "384--395",
  year = "2020",
  organization = "Springer"
}
@article{Thakur2021Beir,
  title = "BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models",
  author = "Thakur, Nandan and Reimers, Nils and Rücklé, Andreas and Srivastava, Abhishek and Gurevych, Iryna", 
  journal = "arXiv preprint arXiv:2104.08663",
  month = "4",
  year = "2021",
  url = "https://arxiv.org/abs/2104.08663",
}

5. 출처


6. 라이센스