ModelMine: A tool to facilitate mining models from open source repositories

Sayed Mohsin Reza, Omar Badreddin, Khandoker Rahad

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Scopus citations

Abstract

Mining Software Repositories (MSR) has opened up new pathways and rich sources of data for research and practical purposes. This research discipline facilitates mining data from open source repositories and analyzing software defects, development activities, processes, patterns, and more. Contemporary mining tools are geared towards data extraction, analysis primarily from textual artifacts and have limitations in representation, ranking and availability. This paper presents ModelMine, a novel mining tool focuses on mining model-based artifacts and designs from open source repositories. ModelMine is designed particularly to mine software repositories, artifacts and commit history to uncover information about software designs and practices in open-source communities. ModelMine supports features that include identification and ranking of open source repositories based on the extent of presence of model-based artifacts and querying repositories to extract models and design artifacts based on customizable criteria. It supports phase-by-phase caching of intermediate results to speed up the processing to enable efficient mining of data. We compare ModelMine against a state-of-the-art tool named PyDriller in terms of performance and usability. The results show that ModelMine has the potential to become instrumental for cross-disciplinary research that combines modeling and design with repository mining and artifacts extraction. URL: https://www.smreza.com/projects/modelmine/

Original languageEnglish (US)
Title of host publicationProceedings - 23rd ACM/IEEE International Conference on Model Driven Engineering Languages and Systems, MODELS-C 2020 - Companion Proceedings
PublisherAssociation for Computing Machinery, Inc
Pages441-450
Number of pages10
ISBN (Electronic)9781450381352
DOIs
StatePublished - Oct 16 2020
Event23rd ACM/IEEE International Conference on Model Driven Engineering Languages and Systems, MODELS-C 2020 - Virtual, Online, Canada
Duration: Oct 16 2020Oct 23 2020

Publication series

NameProceedings - 23rd ACM/IEEE International Conference on Model Driven Engineering Languages and Systems, MODELS-C 2020 - Companion Proceedings

Conference

Conference23rd ACM/IEEE International Conference on Model Driven Engineering Languages and Systems, MODELS-C 2020
Country/TerritoryCanada
CityVirtual, Online
Period10/16/2010/23/20

All Science Journal Classification (ASJC) codes

  • Software
  • Engineering (miscellaneous)

Fingerprint

Dive into the research topics of 'ModelMine: A tool to facilitate mining models from open source repositories'. Together they form a unique fingerprint.

Cite this