TY - GEN
T1 - Understanding android fragmentation with topic analysis of vendor-specific bugs
AU - Han, Dan
AU - Zhang, Chenlei
AU - Fan, Xiaochao
AU - Hindle, Abram
AU - Wong, Kenny
AU - Stroulia, Eleni
PY - 2012
Y1 - 2012
N2 - The fragmentation of the Android ecosystem causes portability and compatibility issues within the entire Android platform, which increases developer workload, delays application deployment, and ultimately disappoints users. This subject is discussed in the press and in scientific publications but it has yet to be systematically examined. The Android bug reports, as submitted by Android-device users, span across operating-system versions and hardware platforms and can provide interesting evidence about the problem. In this paper, we analyze the bug reports related to two popular vendors, HTC and Motorola. First, we manually label the bug reports. Next, we use Labeled-LDA (Latent Dirichlet Allocation) on the labeled data and LDA on the original data, to infer topics. Finally, by examining the relevance of the top 18 bug topics for each vendor's bug reports over time, we classify topics as common or unique (vendor-specific). The latter category constitutes evidence of fragmentation and lack of portability. By comparing Labeled-LDA against LDA, we find that Labeled-LDA produced better, i.e., more feature oriented, topics than LDA. In this paper we find out how fragmentation is manifested within the Android project and we propose a method for tracking fragmentation using feature analysis on project repositories.
AB - The fragmentation of the Android ecosystem causes portability and compatibility issues within the entire Android platform, which increases developer workload, delays application deployment, and ultimately disappoints users. This subject is discussed in the press and in scientific publications but it has yet to be systematically examined. The Android bug reports, as submitted by Android-device users, span across operating-system versions and hardware platforms and can provide interesting evidence about the problem. In this paper, we analyze the bug reports related to two popular vendors, HTC and Motorola. First, we manually label the bug reports. Next, we use Labeled-LDA (Latent Dirichlet Allocation) on the labeled data and LDA on the original data, to infer topics. Finally, by examining the relevance of the top 18 bug topics for each vendor's bug reports over time, we classify topics as common or unique (vendor-specific). The latter category constitutes evidence of fragmentation and lack of portability. By comparing Labeled-LDA against LDA, we find that Labeled-LDA produced better, i.e., more feature oriented, topics than LDA. In this paper we find out how fragmentation is manifested within the Android project and we propose a method for tracking fragmentation using feature analysis on project repositories.
KW - Bug reports
KW - Fragmentation
KW - LDA
KW - Labeled-LDA
KW - Topic mining
UR - https://www.scopus.com/pages/publications/84872327167
U2 - 10.1109/WCRE.2012.18
DO - 10.1109/WCRE.2012.18
M3 - Conference Paper published in a book
AN - SCOPUS:84872327167
SN - 9780769548913
T3 - Proceedings - Working Conference on Reverse Engineering, WCRE
SP - 83
EP - 92
BT - Proceedings - 19th Working Conference on Reverse Engineering, WCRE 2012
T2 - 19th Working Conference on Reverse Engineering, WCRE 2012
Y2 - 15 October 2012 through 18 October 2012
ER -