Publication:
Bridging from syntactic to statistical methods: Classification with automatically segmented features from sequences

Loading...
Thumbnail Image
Identifiers
Publication date
2015-11-01
Defense date
Advisors
Tutors
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Impact
Google Scholar
Export
Research Projects
Organizational Units
Journal Issue
Abstract
To Integrate The Benefits Of Statistical Methods Into Syntactic Pattern Recognition, A Bridging Approach Is Proposed: (I) Acquisition Of A Grammar Per Recognition Class (Ii) Comparison Of The Obtained Grammars In Order To Find Substructures Of Interest Represented As Sequences Of Terminal And/Or Non-Terminal Symbols And Filling The Feature Vector With Their Counts (Iii) Hierarchical Feature Selection And Hierarchical Classification, Deducing And Accounting For The Domain Taxonomy. The Bridging Approach Has The Benefits Of Syntactic Methods: Preserves Structural Relations And Gives Insights Into The Problem. Yet, It Does Not Imply Distance Calculations And, Thus, Saves A Non-Trivial Task-Dependent Design Step. Instead It Relies On Statistical Classification From Many Features. Our Experiments Concern A Difficult Problem Of Chemical Toxicity Prediction. The Code And The Data Set Are Open-Source. (C) 2015 Elsevier Ltd. All Rights Reserved.
Description
Keywords
syntactic pattern recognition, grammatical inference, feature segmentation, smiles parser, feature extraction
Bibliographic citation
Sidorova, J., García, J. (2015). Bridging from syntactic to statistical methods: Classification with automatically segmented features from sequences. Pattern Recognition, 48, pp. 3749-3756. https://doi.org/10.1016/j.patcog.2015.05.001