A Corpus-based Study of Reporting Verbs in Citation Texts Using Natural Language Processing

Study of Reporting Verbs in Citation Texts Using Natural Language Processing

Keywords:

reporting verb, citation analysis, sentiment analysis, NLP, scientometrics

Abstract

In scientific literary writings, authors often cite other researches to formulate their opinions and findings. The selection of the reporting verb for such purpose plays an important role in their citations. Reporting verbs may exhibit variety of strengths when used in different contexts and scenarios. Therefore, a compilation of reporting verbs used by authors in various contexts and its formulation in the form of a dataset can provide a basis for corpus-based analysis of citations and its reasons. Sentiment analysis techniques can categorize a citation into Positive, Negative or Neutral sentiments. Natural Language Processing techniques can automatically tag verbs used in a citation with high accuracy. This paper is a sentiment-based study, conducted to formulate a citations‘ reporting verb corpus, by categorizing the citation texts from a selected dataset into three sentiments. Using NLP techniques, reporting verbs are extracted from these citation texts and their frequencies are calculated. The study also describes the analysis of extracted verbs in each sentiment.

Published

2020-06-18 — Updated on 2020-07-02

Versions

How to Cite

1.
A Corpus-based Study of Reporting Verbs in Citation Texts Using Natural Language Processing: Study of Reporting Verbs in Citation Texts Using Natural Language Processing. Corporum [Internet]. 2020Jul.2 [cited 2024Apr.23];2(1):25-36. Available from: https://journals.au.edu.pk/ojscrc/index.php/crc/article/view/42