NTU NL2KB (Natural Language to Knowledge Base) Resource

Introduction

A total of 7,139 Chinese relation patterns that cover 1,087 DBpedia properties are extracted and verified by human annotators. This resource can be used for knowledge base construction and knowledge base retrieval (e.g., question-answering).

Format

Each line in the UTF-8 encoded tab separated file is a relation pattern. The first column is the property defined in DBpedia. The second column is the pattern in Chinese. The third and the fourth columns are the support and the confidence values of this pattern, respectively. The last column denotes this pattern is correct (T) or wrong (F) verified by annotators.

As the example shown as follows, "<實體一> 執導 <實體二>" is a Chinese pattern that suggests the DBpedia property producer with 103 supports and a confidence of 0.112. This mapping is validated by human annotators.

producer <實體一> 執導 <實體二> 103 0.112 T

Download

All patterns in traditional Chinese and simplified Chinese.

Demo

Visit our demonstration system by clicking here.

How to Cite this resource

Please cite the following paper when referring to NL2KB in academic publications and papers.

Sheng-Lun Wei, Yen-Pin Chiu, Hen-Hsen Huang, and Hsin-Hsi Chen (2016). “NL2KB: Resolving Vocabulary Gap between Natural Language and Knowledge Base in Knowledge Base Construction and Retrieval.” Proceedings of the 26th International Conference on Computational Linguistics (COLING 2016), December 11-16, 2016, Osaka, Japan.