Creating Bengali Freebase Using Wikidata

Creating Bengali Freebase Using Wikidata

下载PDF

导出

摘要 Freebase is a large collaborative knowledge base and database of general, structured information for public use. Its structured data had been harvested from many sources, including individual, user-submitted wiki contributions. Its aim is to create a global resource so that people (and machines) can access common information more effectively which is mostly available in English. In this research work, we have tried to build the technique of creating the Freebase for Bengali language. Today the number of Bengali articles on the internet is growing day by day. So it has become a necessary to have a structured data store in Bengali. It consists of different types of concepts (topics) and relationships between those topics. These include different types of areas like popular culture (e.g. films, music, books, sports, television), location information (restaurants, geolocations, businesses), scholarly information (linguistics, biology, astronomy), birth place of (poets, politicians, actor, actress) and general knowledge (Wikipedia). It will be much more helpful for relation extraction or any kind of Natural Language Processing (NLP) works on Bengali language. In this work, we identified the technique of creating the Bengali Freebase and made a collection of Bengali data. We applied SPARQL query language to extract information from natural language (Bengali) documents such as Wikidata which is typically in RDF (Resource Description Format) triple format. Freebase is a large collaborative knowledge base and database of general, structured information for public use. Its structured data had been harvested from many sources, including individual, user-submitted wiki contributions. Its aim is to create a global resource so that people (and machines) can access common information more effectively which is mostly available in English. In this research work, we have tried to build the technique of creating the Freebase for Bengali language. Today the number of Bengali articles on the internet is growing day by day. So it has become a necessary to have a structured data store in Bengali. It consists of different types of concepts (topics) and relationships between those topics. These include different types of areas like popular culture (e.g. films, music, books, sports, television), location information (restaurants, geolocations, businesses), scholarly information (linguistics, biology, astronomy), birth place of (poets, politicians, actor, actress) and general knowledge (Wikipedia). It will be much more helpful for relation extraction or any kind of Natural Language Processing (NLP) works on Bengali language. In this work, we identified the technique of creating the Bengali Freebase and made a collection of Bengali data. We applied SPARQL query language to extract information from natural language (Bengali) documents such as Wikidata which is typically in RDF (Resource Description Format) triple format.

作者 Rukaiya Habib Mahmuda Ferdous Md Musfique Anwar Rukaiya Habib;Mahmuda Ferdous;Md Musfique Anwar(Department of Computer Science and Engineering, Jahangirnagar University, Dhaka, Bangladesh;Department of Computer Science and Engineering, University of Development Alternative, Dhaka, Bangladesh)

机构地区 Department of Computer Science and Engineering Department of Computer Science and Engineering

出处《Journal of Computer and Communications》 2023年第5期151-160,共10页 电脑和通信（英文）

关键词 KNOWLEDGE-BASE Structured Data NLP RDF Knowledge-Base Structured Data NLP RDF

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

1“亚洲50最佳餐厅”将于2023年3月在新加坡举行线下颁奖典礼及一系列活动[J].美食,2023(2):93-93.
2Zeynep Banu Ozger,Nurgul Yuzbasioglu Uslu.An Effective Discrete Artificial Bee Colony Based SPARQL Query Path Optimization by Reordering Triples[J].Journal of Computer Science & Technology,2021,36(2):445-462.
3Marina V.Sokolova,Francisco J.Gomez,Larisa N.Borisoglebskaya.Migration from an SQL to a hybrid SQL/NoSQL data model[J].Journal of Management Analytics,2020,7(1):1-11. 被引量：1
4MONICA LIAU.BAN FRANCISCO Fad and fancy[J].城市漫步（GBA版）,2014(7):24-25.
5征稿简则[J].中国科学：物理学、力学、天文学,2023,53(4).
6AELRED DOYLE.FINDING THEM GONE Red Pine Salutes China's Great Poets[J].城市漫步（上海版、英文）,2016(3):51-51.
7Qiang Lin,Yongbin Liu,Wen Wen,Zhihua Tao,Chunping Ouyang,Yaping Wan.Ensemble Making Few-Shot Learning Stronger[J].Data Intelligence,2022,4(3):529-551.
8罗君艺.教育定位、价值判断与方[J].师道（人文）,2023(4):9-12. 被引量：1
9刘博,张佳慧,李建强,李永,郎建垒.大气污染领域本体的半自动构建及语义推理[J].北京工业大学学报,2021,47(3):246-259. 被引量：4
10王霜奉.ChatGPT的“火”能否持续炙热?[J].上海信息化,2023(3):48-49.

Journal of Computer and Communications

2023年第5期

浏览历史

内容加载中请稍等...

Creating Bengali Freebase Using Wikidata

相关作者

相关机构

相关主题

浏览历史