Syntactic Role Identification of Mathematical Expressions

abstract

2017 IEEE. This paper presents a prediction algorithm to infer the syntactic role (SR) of mathematical expressions (ME), or SRme, in ME-plaintext mixed sentences. SRme is a predicted syntax label of ME, which could be integrated into any constituent parser to improve their accuracy in sentence parsing. SRME is based upon three features of ME placement in a sentence: properness of Sentence structure (feature F3), properties of ME (feature F2), and PoS of the Local neighbor plain text (feature F1). An inside-outside inspired algorithm is proposed for SRME by maximizing the probability of a relaxed parsing tree. Features in F2 was found to fit into both exponential and Poisson distributions, which could fuse with other features to re-weight the prediction rule that improves the prediction precision for SRme as a noun phrase (noun modifier) by 3.6% (18.7%). F1, F2, and F3 were found to complement each other. Significant discriminative patterns on the part-of-speech (PoS) of the neighbor plaintext are adopted to build a Nave Bayesian classifier, which is fused with the F3 baseline that improved the precision of the prediction of SRme as a sentence by 10%. The overall error rate of the SRME prediction algorithm was found to be 15.1% based on an experiment using a public ME-plaintext mixed parsing tree data set provided by Elsevier.

name of conference

2017 Twelfth International Conference on Digital Information Management (ICDIM)

authors

Liu, Jyh

published proceedings

2017 TWELFTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM)

author list (cited authors)

Wang, X., Lin, J., Vrecenar, R., & Liu, J.

citation count

2

complete list of authors

Wang, Xing||Lin, Jason||Vrecenar, Ryan||Liu, Jyh-Charn

publication date

January 2017

publisher

Institute of Electrical and Electronics Engineers (IEEE) Publisher

keywords

Mathematical Expressions
Me-plaintext Mixed
Nlp
Parsing
Syntactic Role

Digital Object Identifier (DOI)

10.1109/icdim.2017.8244676

International Standard Book Number (ISBN) 13

9781538606643

start page

179

end page

184

volume

2018-January

URL

http://dx.doi.org/10.1109/icdim.2017.8244676

Syntactic Role Identification of Mathematical Expressions Conference Paper

Overview

abstract

name of conference

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

Research

keywords

Identity

Digital Object Identifier (DOI)

International Standard Book Number (ISBN) 13

Additional Document Info

start page

end page

volume

Other

URL