Publication: Automatic classification of web images as UML static diagrams using machine learning techniques
dc.affiliation.dpto | UC3M. Departamento de Informática | es |
dc.affiliation.grupoinv | UC3M. Grupo de Investigación: Knowledge Reusing | es |
dc.contributor.author | Moreno Pelayo, Valentín | |
dc.contributor.author | Génova Fuster, Gonzalo | |
dc.contributor.author | Alejandres Sánchez, Manuela | |
dc.contributor.author | Fraga Vázquez, Anabel | |
dc.contributor.funder | European Commission | en |
dc.contributor.funder | Ministerio de Economía y Competitividad (España) | es |
dc.date.accessioned | 2022-02-21T10:24:07Z | |
dc.date.available | 2022-02-21T10:24:07Z | |
dc.date.issued | 2020-04-01 | |
dc.description.abstract | Our purpose in this research is to develop a method to automatically and efficiently classify web images as Unified Modeling Language (UML) static diagrams, and to produce a computer tool that implements this function. The tool receives a bitmap file (in different formats) as an input and communicates whether the image corresponds to a diagram. For pragmatic reasons, we restricted ourselves to the simplest kinds of diagrams that are more useful for automated software reuse: computer-edited 2D representations of static diagrams. The tool does not require that the images are explicitly or implicitly tagged as UML diagrams. The tool extracts graphical characteristics from each image (such as grayscale histogram, color histogram and elementary geometric forms) and uses a combination of rules to classify it. The rules are obtained with machine learning techniques (rule induction) from a sample of 19,000 web images manually classified by experts. In this work, we do not consider the textual contents of the images. Our tool reaches nearly 95% of agreement with manually classified instances, improving the effectiveness of related research works. Moreover, using a training dataset 15 times bigger, the time required to process each image and extract its graphical features (0.680 s) is seven times lower. | en |
dc.description.sponsorship | This research has received funding from the CRYSTAL project – Critical System Engineering Acceleration (European Union’s Seventh Framework Program, FP7/2007-2013, ARTEMIS Joint Undertaking grant agreement n° 332830); and from the AMASS project – Architecture-driven, Multi-concern and Seamless Assurance and Certification of Cyber-Physical Systems (H2020-ECSEL grant agreement nº 692474; Spain’s MINECO ref. PCIN-2015-262). | en |
dc.identifier.bibliographicCitation | Moreno, V.; Génova, G.; Alejandres, M.; Fraga, A. Automatic Classification of Web Images as UML Static Diagrams Using Machine Learning Techniques. Appl. Sci. 2020, 10, 2406. https://doi.org/10.3390/app10072406 | en |
dc.identifier.doi | https://doi.org/10.3390/app10072406 | |
dc.identifier.issn | 2076-3417 | |
dc.identifier.publicationfirstpage | 2406 | |
dc.identifier.publicationissue | 7 | |
dc.identifier.publicationtitle | Applied Sciences (Switzerland) | en |
dc.identifier.publicationvolume | 10 | |
dc.identifier.uri | https://hdl.handle.net/10016/34175 | |
dc.identifier.uxxi | AR/0000029188 | |
dc.language.iso | eng | en |
dc.publisher | MDPI | en |
dc.relation.projectID | info:eu-repo/grantAgreement/EC/ART-010000-2013-1 | en |
dc.relation.projectID | info:eu-repo/grantAgreement/EC/692474 | en |
dc.relation.projectID | Gobierno de España. PCIN-2015-262 | |
dc.rights | © MDPI, 2020 | es |
dc.rights | Atribución 3.0 España | |
dc.rights.accessRights | open access | en |
dc.rights.uri | http://creativecommons.org/licenses/by/3.0/es/ | |
dc.subject.eciencia | Informática | es |
dc.subject.other | uml diagram recognition | en |
dc.subject.other | image processing | en |
dc.subject.other | image classification | en |
dc.subject.other | rule induction | en |
dc.subject.other | classification tool | en |
dc.title | Automatic classification of web images as UML static diagrams using machine learning techniques | en |
dc.type | research article | * |
dc.type.hasVersion | VoR | * |
dspace.entity.type | Publication |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- automatic_AS_2020.pdf
- Size:
- 1.59 MB
- Format:
- Adobe Portable Document Format
- Description: