Home // eKNOW 2014, The Sixth International Conference on Information, Process, and Knowledge Management // View article


Uncovering File Relationships using Association Mining and Topic Modeling

Authors:
Namita Dave
Delmar Davis
Karen Potts
Hazeline U. Asuncion

Keywords: Association mining; Topic Modeling; Software Engineering

Abstract:
Software maintenance tasks, such as feature enhancements and bug fixes, require familiarity with the entire software system. A modification task could become very time consuming if there is no prior knowledge of the system. Association mining has been used to identify the files that frequently change together in a software repository, and this information can aid a software engineer locate relevant files for a maintenance task. However, association mining techniques are limited to the amount of project history stored in a software repository. We address this difficulty by using a technique that combines association mining with topic modeling, referred to as Frequent Pattern Growth with Latent Dirichlet Allocation (FP-LDA). Topic modeling aims to uncover file relationships by learning semantic topics from source files. We validated our technique via case studies on two open source projects. Our results indicate that topic modeling can increase the effectiveness of association mining in uncovering the file relationships.

Pages: 105 to 111

Copyright: Copyright (c) IARIA, 2014

Publication date: March 23, 2014

Published in: conference

ISSN: 2308-4375

ISBN: 978-1-61208-329-2

Location: Barcelona, Spain

Dates: from March 23, 2014 to March 27, 2014