MTab4DBpedia: Semantic Annotation for Tabular Data with DBpedia
Search relevant entities from DBpedia (2016-10)
The query is Tokyo, and get 20 relevant entities.
Command:
curl --request POST --header "Content-Type: application/json" --data '{"q":"Tokyo", "limit":20}' https://dbpedia.mtab.app/api/v1/search
Get entity information from DBpedia (2016-10). The responded object include DBpedia title, mapping to Wikidata, Wikipedia, label, Aliases, types, pagerank score, entity statements, and literal statements.
Get information of the entity Hideaki Takeda
Command:
curl --request POST --header "Content-Type: application/json" --data '{"q":"Hideaki Takeda"}' https://dbpedia.mtab.app/api/v1/info
Table annotation with MTab4DBpedia.
Please refer m_main.py on how to use it.
Submit annotation file (CEA, CTA, CPA), then get the results.
Please refer m_main.py on how to use it.
Annotate numerical column of tables with Knowledge Graph properties
Command:
curl --request POST --header "Content-Type: application/json" --data '{"values":[1.50, 1.51, 1.52, 1.53, 1.54], "limit": 5}' https://dbpedia.mtab.app/api/v1/num
or please refer m_main.py for other examples.
Clone MTab4DBpedia, and open project
git clone https://github.com/phucty/mtab4dbpedia.git
cd mtab4dbpedia
Create conda environment, activate, and install mtab4dbpedia
conda create -n mtab4dbpedia python=3.6
conda activate mtab4dbpedia
pip install -r requirements.txt
Other setup:
Change DIR_ROOT in m_setting.py to your project directory. Current value is (This is the directory in my laptop)
DIR_ROOT = "/Users/phuc/git/mtab4dbpedia"
Decompress data files
data/semtab_2019_dbpedia_2016-10.zip
data/semtab_org.zip
python exp_semtab.py
To make a fair evaluation, it is important to have the same target DBpedia version because DBpedia change overtime. Additionally, using up-to-date resources also could yield a higher performance since data is more complete than older version. It is unfair with the previous study used the older version of DBpedia.
How to adapt the dataset with DBpedia 2016-10?
Open resources for reproducibility:
Adapt Ground Truth:
Phuc Nguyen, Hideaki Takeda, MTab: Tabular Data Annotation, NII Open House June 2021. [video]
Phuc Nguyen, Ikuya Yamada, Hideaki Takeda, MTabES: Entity Search with Keyword Search, Fuzzy Search, and Entity Popularities, In The 35th Annual Conference of the Japanese Society for Artificial Intelligence (JSAI), 2021. [video]
If you find MTab4DBpedia tool useful in your work, and you want to cite our work, please use the following referencee:
@inproceedings{2019_mtab4dbpedia,
author = {Phuc Nguyen and
Natthawut Kertkeidkachorn and
Ryutaro Ichise and
Hideaki Takeda},
title = {MTab: Matching Tabular Data to Knowledge Graph using Probability Models},
booktitle = {SemTab@ISWC 2019},
volume = {2553},
pages = {7--14},
publisher = {CEUR-WS.org},
year = {2019},
}
Phuc Nguyen (phucnt@nii.ac.jp
)