Data

Data files used to study change dynamics in software systems

Swinburne University of Technology
Rajesh Vasa (Owned by)
Viewed: [[ro.stat.viewed]] Cited: [[ro.stat.cited]] Accessed: [[ro.stat.accessed]]
ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&rfr_id=info%3Asid%2FANDS&rft_id=http://hdl.handle.net/1959.3/190177&rft.title=Data files used to study change dynamics in software systems&rft.identifier=http://hdl.handle.net/1959.3/190177&rft.publisher=Swinburne University of Technology&rft.description=It is a widely accepted fact that evolving software systems change and grow. However, it is less well-understood how change is distributed over time, specifically in object oriented software systems. The patterns and techniques used to measure growth permit developers to identify specific releases where significant change took place as well as to inform them of the longer term trend in the distribution profile. This knowledge assists developers in recording systemic and substantial changes to a release, as well as to provide useful information as input into a potential release retrospective. In order to manage the evolution of complex software systems effectively, it is important to identify change-prone classes as early as possible, but these analysis methods can only be applied after a mature release of the code has been developed. Specifically, developers need to know where they can expect change, the likelihood of a change, and the magnitude of these modifications in order to take proactive steps and mitigate any potential risks arising from these changes. We present a statistical analysis of change in approximately 55000 unique classes across all projects under investigation. The raw metric data (4 .txt files and 4 .log files in a .zip file measuring ~2MB in total) is provided as a comma separated values (CSV) file, and the first line of the CSV file contains the header. A detailed output of the statistical analysis undertaken is provided as log files generated directly from Stata (statistical analysis software).&rft.creator=Rajesh Vasa&rft.date=2011&rft.relation=http://hdl.handle.net/1959.3/95058&rft_subject=COMPUTER SOFTWARE AND SERVICES&rft_subject=INFORMATION AND COMMUNICATION SERVICES&rft_subject=Open source software&rft_subject=Software maintenance&rft_subject=Open Software&rft_subject=INFORMATION AND COMPUTING SCIENCES&rft_subject=COMPUTER SOFTWARE&rft_subject=Software engineering&rft_subject=PhD thesis&rft_subject=Metrics&rft_subject=Software Engineering&rft_subject=Software evolution&rft.type=dataset&rft.language=English Access the data

Access:

Other view details

Copyright © 2010 Rajesh Vasa.

The files are made available on open access with the kind permission of the author. Licence for re-use to be determined.

Full description

It is a widely accepted fact that evolving software systems change and grow. However, it is less well-understood how change is distributed over time, specifically in object oriented software systems. The patterns and techniques used to measure growth permit developers to identify specific releases where significant change took place as well as to inform them of the longer term trend in the distribution profile. This knowledge assists developers in recording systemic and substantial changes to a release, as well as to provide useful information as input into a potential release retrospective. In order to manage the evolution of complex software systems effectively, it is important to identify change-prone classes as early as possible, but these analysis methods can only be applied after a mature release of the code has been developed. Specifically, developers need to know where they can expect change, the likelihood of a change, and the magnitude of these modifications in order to take proactive steps and mitigate any potential risks arising from these changes. We present a statistical analysis of change in approximately 55000 unique classes across all projects under investigation. The raw metric data (4 .txt files and 4 .log files in a .zip file measuring ~2MB in total) is provided as a comma separated values (CSV) file, and the first line of the CSV file contains the header. A detailed output of the statistical analysis undertaken is provided as log files generated directly from Stata (statistical analysis software).

This dataset is part of a larger collection

Click to explore relationships graph

145.03889,-37.8226

145.038886,-37.822599

Identifiers