TA-RE: An exchange language for mining software repositories

Sunghun Kim*, Thomas Zimmermann, Miryung Kim, Ahmed Hassan, Audris Mockus, Tudor Girba, Martin Pinzger, E. James Whitehead, Andreas Zeller

*Corresponding author for this work

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

16 Citations (Scopus)

Abstract

Software repositories have been getting a lot of attention from researchers in recent years. In order to analyze software repositories, it is necessary to first extract raw data from the version control and problem tracking systems. This poses two challenges: (1) extraction requires a non-trivial effort, and (2) the results depend on the heuristics used during extraction. These challenges burden researchers that are new to the community and make it difficult to benchmark software repository mining since it is almost impossible to reproduce experiments done by another team. In this paper we present the TA-RE corpus. TA-RE collects extracted data from software repositories in order to build a collection of projects that will simplify extraction process. Additionally the collection can be used for benchmarking. As the first step we propose an exchange language capable of making sharing and reusing data as simple as possible.

Original languageEnglish
Title of host publicationProceedings of the 2006 International Workshop on Mining Software Repositories, MSR '06, Co-located with the 28th International Conference on Software Engineering, ICSE 2006
Pages22-25
Number of pages4
DOIs
Publication statusPublished - 2006
Externally publishedYes
Event2006 International Workshop on Mining Software Repositories, MSR '06, Co-located with the 28th International Conference on Software Engineering, ICSE 2006 - Shanghai, China
Duration: 20 May 200628 May 2006

Publication series

NameProceedings - International Conference on Software Engineering
ISSN (Print)0270-5257

Conference

Conference2006 International Workshop on Mining Software Repositories, MSR '06, Co-located with the 28th International Conference on Software Engineering, ICSE 2006
Country/TerritoryChina
CityShanghai
Period20/05/0628/05/06

Keywords

  • analysis
  • corpus
  • prediction
  • software repository mining

Fingerprint

Dive into the research topics of 'TA-RE: An exchange language for mining software repositories'. Together they form a unique fingerprint.

Cite this