Identifying negation cues and their scope in a text is an important subtask of information extraction that can benefit other natural language processing tasks,including but not limited to medical data mining,relation ...Identifying negation cues and their scope in a text is an important subtask of information extraction that can benefit other natural language processing tasks,including but not limited to medical data mining,relation extraction,question answering and sentiment analysis.The tasks of negation cue and negation scope detection can be treated as sequence labelling problems.In this paper,a system is presented having two components:negation cue detection and negation scope detection.In the first phase,a conditional random field(CRF) model is trained to detect the negation cues using a lexicon of negation words and some lexical and contextual features.Then,another CRF model is trained to detect the scope of each negation cue identified in the first phase,using basic lexical and contextual features.These two models are trained and tested using the dataset distributed within the* Sem Shared Task 2012 on resolving the scope and focus of negation.Experimental results show that the system outperformed all the systems submitted to this shared task.展开更多
Identifying negative or speculative narrative frag- ments from facts is crucial for deep understanding on natu- ral language processing (NLP). In this paper, we firstly con- struct a Chinese corpus which consists of...Identifying negative or speculative narrative frag- ments from facts is crucial for deep understanding on natu- ral language processing (NLP). In this paper, we firstly con- struct a Chinese corpus which consists of three sub-corpora from different resources. We also present a general framework for Chinese negation and speculation identification. In our method, first, we propose a feature-based sequence labeling model to detect the negative or speculative cues. In addition, a cross-lingual cue expansion strategy is proposed to increase the coverage in cue detection. On this basis, this paper presents a new syntactic structure-based framework to identify the linguistic scope of a negative or speculative cue, instead of the traditional chunking-based framework. Experimental results justify the usefulness of our Chinese corpus and the appropriateness of our syntactic structure-based framework which has showed significant improvement over the state-of-the-art on Chinese negation and speculation identification.展开更多
The automatic detection of negation is a crucial task in a wide-range of natural language processing(NLP)applications,including medical data mining,relation extraction,question answering,and sentiment analysis.In this...The automatic detection of negation is a crucial task in a wide-range of natural language processing(NLP)applications,including medical data mining,relation extraction,question answering,and sentiment analysis.In this paper,we present a syntactic path-based hybrid neural network architecture,a novel approach to identify the scope of negation in a sentence.Our hybrid architecture has the particularity to capture salient information to determine whether a token is in the scope or not,without relying on any human intervention.This approach combines a bidirectional long shortterm memory(Bi-LSTM)network and a convolutional neural network(CNN).The CNN model captures relevant syntactic features between the token and the cue within the shortest syntactic path in both constituency and dependency parse trees.The Bi-LSTM learns the context representation along the sentence in both forward and backward directions.We evaluate our model on the Bioscope corpus,and get 90.82%F-score(78.31%PCS)on the abstract sub-corpus,outperforming features-dependent approaches.展开更多
Identifying negation scopes in a text is an important subtask of information extraction, that can benefit other natural language processing tasks, like relation extraction, question answering and sentiment analysis. A...Identifying negation scopes in a text is an important subtask of information extraction, that can benefit other natural language processing tasks, like relation extraction, question answering and sentiment analysis. And serves the task of social media text understanding. The task of negation scope detection can be regarded as a token-level sequence labeling problem. In this paper, we propose different models based on recurrent neural networks (RNNs) and word embedding that can be successfully applied to such tasks without any task-specific feature engineering efforts. Our experimental results show that RNNs, without using any hand-crafted features, outperform feature-rich CRF-based model.展开更多
基金Supported by the National High Technology Research and Development Programme of China(No.2015AA015407)the National Natural Science Foundation of China(No.61273321)the Specialized Research Fund for the Doctoral Program of Higher Education(No.20122302110039)
文摘Identifying negation cues and their scope in a text is an important subtask of information extraction that can benefit other natural language processing tasks,including but not limited to medical data mining,relation extraction,question answering and sentiment analysis.The tasks of negation cue and negation scope detection can be treated as sequence labelling problems.In this paper,a system is presented having two components:negation cue detection and negation scope detection.In the first phase,a conditional random field(CRF) model is trained to detect the negation cues using a lexicon of negation words and some lexical and contextual features.Then,another CRF model is trained to detect the scope of each negation cue identified in the first phase,using basic lexical and contextual features.These two models are trained and tested using the dataset distributed within the* Sem Shared Task 2012 on resolving the scope and focus of negation.Experimental results show that the system outperformed all the systems submitted to this shared task.
基金This research was supported by the National Natural Science Foundation of China (Grant Nos. 61373097, 61272259 and 61272260). Special thanks to Zhancheng Chen, Zhong Qian, and the anonymous reviewers for insightful comments and suggestions.
文摘Identifying negative or speculative narrative frag- ments from facts is crucial for deep understanding on natu- ral language processing (NLP). In this paper, we firstly con- struct a Chinese corpus which consists of three sub-corpora from different resources. We also present a general framework for Chinese negation and speculation identification. In our method, first, we propose a feature-based sequence labeling model to detect the negative or speculative cues. In addition, a cross-lingual cue expansion strategy is proposed to increase the coverage in cue detection. On this basis, this paper presents a new syntactic structure-based framework to identify the linguistic scope of a negative or speculative cue, instead of the traditional chunking-based framework. Experimental results justify the usefulness of our Chinese corpus and the appropriateness of our syntactic structure-based framework which has showed significant improvement over the state-of-the-art on Chinese negation and speculation identification.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.61632011,61772153,71490722)Hei-longjiang philosophy and social science research project(16TQD03)。
文摘The automatic detection of negation is a crucial task in a wide-range of natural language processing(NLP)applications,including medical data mining,relation extraction,question answering,and sentiment analysis.In this paper,we present a syntactic path-based hybrid neural network architecture,a novel approach to identify the scope of negation in a sentence.Our hybrid architecture has the particularity to capture salient information to determine whether a token is in the scope or not,without relying on any human intervention.This approach combines a bidirectional long shortterm memory(Bi-LSTM)network and a convolutional neural network(CNN).The CNN model captures relevant syntactic features between the token and the cue within the shortest syntactic path in both constituency and dependency parse trees.The Bi-LSTM learns the context representation along the sentence in both forward and backward directions.We evaluate our model on the Bioscope corpus,and get 90.82%F-score(78.31%PCS)on the abstract sub-corpus,outperforming features-dependent approaches.
文摘Identifying negation scopes in a text is an important subtask of information extraction, that can benefit other natural language processing tasks, like relation extraction, question answering and sentiment analysis. And serves the task of social media text understanding. The task of negation scope detection can be regarded as a token-level sequence labeling problem. In this paper, we propose different models based on recurrent neural networks (RNNs) and word embedding that can be successfully applied to such tasks without any task-specific feature engineering efforts. Our experimental results show that RNNs, without using any hand-crafted features, outperform feature-rich CRF-based model.