SlideShare a Scribd company logo
Journal for Research | Volume 02 | Issue 06 | August 2016
ISSN: 2395-7549
All rights reserved by www.journalforresearch.org 24
Automated Bug Triage using Advanced Data
Reduction Techniques
Irin Ani John Tintu Alphonsa Thomas
PG Scholar Assistant Professor
Department of Computer Science & Engineering Department of Computer Science & Engineering
Amal Jyothi College of Engineering Kanjirappally , Kottayam,
India
Amal Jyothi College of Engineering Kanjirappally , Kottayam,
India
Abstract
Bug triage is an important step in the process of bug fixing. The goal of bug triage is to correctly assign a developer to a newly
reported bug in the system. To perform the automated bug triage, text classification techniques are applied. This will helps to
reduce the time cost in manual work. To reduce the scale and improve the quality of bug data, the proposed system addresses the
data reduction techniques, instance selection and feature selection for bug triage. The instance selection technique used here is to
identify the relevant bugs that can match the newly reported bug. The feature selection technique is used to select the relevant
data from each bug in the training set. A predictive model is proposed to identify the order in which the data reduction
techniques are applied for each newly reported bug. This step will improve the performance of the classification process. An
experimental study using Eclipse and Firefox bug data is undergone in which the proposed system shows an accuracy of 73%.
Keywords: Data mining, bug triage, classification, data reduction, instance selection, feature selection
_______________________________________________________________________________________________________
I. INTRODUCTION
Bug fixing is a significant and time consuming process in software maintenance. For a large-scale soft-ware project, the number
of daily bugs is so large. It is impossible to handle them without delaying. Software bugs are inevitable and bug fixing is an
expensive process in software development. Software companies spend over 45 percent of cost in fixing bugs. Large software
projects maintain bug repositories for the support of information collection and to manage the similar future bugs. A bug
repository has an important role in handling software bugs. Bug repository is a typical software repository for storing bug details.
It keeps the textual description about how to reproduce the bug and updates according to the current status of bug fixes.
Bug triage, an important step for bug fixing, is to assign a new bug to a relevant developer for further handling. A general
method for bug triage is to as-sign bugs manually. In practice, due to the frequent changes of software development teams, it is
di cult to identify the correct developer in manual triage. Taking Eclipse2 as an example[1], Anvik reports that an average of 37
bugs per day are submitted to the bug tracking system and 3 person-hours per day are required for the manual triage; the
empirical study[11] by Jeong et al. shows that 44% of bugs have been assigned to the wrong developer after the first assign-
ment. To solve these problems, some machine learning algorithms are employed to conduct automatic bug triage. Most of the
bug triage approaches are based on text categorization. However, these approaches suffer from two problems, namely the large-
scale and the low-quality. It is necessary to collect large-scale training sets of bugs, due to the large number of bugs, to obtain
good results for bug triage. It may cost much time to directly use the large-scale training set in the bug triage process. The quality
of the original bug re-ports is not good enough. Two typical characteristics of low-quality bugs are noise and redundancy. Noisy
bugs may mislead related developers while redundant bugs waste the limited time of bug handling.
The proposed system is related with an automatic bug triage approach, which applies text classification techniques to predict
developers for new bug reports. In this system, each bug report is considered as a document and its corresponding developer is
considered as the label of the document. Since software bug data are a kind of free-form text data (generated by developers), it is
necessary to generate well-processed bug data to facilitate the application. This system ad-dresses the problem of data reduction
for bug triage, i.e., how to reduce the bug data to save the labour cost of developers and to improve the quality of bug data to
facilitate the effective bug triage. The pro-posed system reduces the bug data according to two criteria: the scale of a data set and
the accuracy of bug triage.
Instance selection and feature selection are the two data reduction techniques implemented in the pro-posed system. While the
instance selection technique selects the relevant bugs in the training set, the feature selection technique selects the relevant data
in each bug in the training set according to the newly reported bug. This system determines the order of ap-plying the data
reduction techniques for each newly re-ported bug. The order is determined by the attributes extracted from the new bug report.
A binary classifier is trained to predict the order of reduction [12].
The following sections are about the study of the related works, about the proposed system, the experiments on the proposed
system, the conclusions and future works.
Automated Bug Triage using Advanced Data Reduction Techniques
(J4R/ Volume 02 / Issue 06 / 005)
All rights reserved by www.journalforresearch.org 25
II. RELATED WORK
For modelling the bug data Anvik et al.[1] investigates the open bug repositories and the anatomy of bug reports. They proposed
a semi-automated bug triage approach, which applies text classification techniques to predict developers for bug reports. To
investigate the quality of bug data, Zimmermann et al.[2] design questionnaires to developers and users in three open source
projects. Based on the analysis of questionnaires, they characterize what makes a good bug report and train a classifier to identify
whether the quality of a bug report should be improved.
Aggarwal et al.[5] developed a graphical paradigm for the text representation to process on the textual data. It preserves a
high-level information about the ordering and distance between the words in the document. The developer social network is
helpful to understand the developer community and the project evolution. By mapping bug priorities to developers, Xuan et
al.[4] identify the developer prioritization in open source bug repositories. The developer prioritization can distinguish
developers and assist tasks in software maintenance. To apply existing solutions to the new bug, Kim et al.[3] developed bug and
x knowledge base by analysing the history of bug xes.
Cubranic and Murphy [6] first propose the problem of automatic bug triage to reduce the cost of manual bug triage. They
apply text classification techniques to predict related developers. Zou et al.[7][12] proposed the training set reduction with both
feature selection and instance selection techniques for bug triage. Brighton et al.[8] introduce an instance selection algorithm and
Yang and Pedersen [9] made a comparative study on different feature selection methods. Janaki Meera et al.[10] proposes a new
feature selection algorithm based on chi square statistics for the Naive Bayes classifier.
III. PROPOSED SYSTEM
The proposed system comprises five main processing stages. The first step for the bug triage is the pre-processing of the bug
data. Then the system predicts or-der of data reduction. It follows the instance selection and feature selection techniques which
are applied for the data reduction process. Using the reduced training data, the classification process is applied for the newly
reported bug.
Preprocessing:
Data pre-processing is one of the most important steps in text mining activity. The system is dealing with large number of text
documents so this data pre-processing step is essential. This text pre-processing system consist of activities like extraction,
validation, stop word removal and storing data formation.
The bug details are provided as JSON and XML les by the providers. The system needs to import these les into the triaging
system. After loading the les, to process the data, both JSON and XML data are converted into java readable format. The input le
contains all the reported bugs. For training set, the bugs which are in Fixed status are only needed. Stop words are language
specific functional words which carry no information. These are removed in this stage. The valid bug data which are used for
training dataset is stored permanently in relational database. The system uses MySQL as the database to store the training set.
Instance Selection:
The system compares the new bug with training dataset, and only the relevant bugs are selected for further processing. Each
words in the bug description of the new bug is searched for match in the training set. From the training set the matching bugs are
considered as relevant bugs. Thus the dataset dimension is largely reduced and the accuracy of the classification process is
improved.
Feature Selection:
In the proposed system each attributes of the bug is considered as the features of the bug. In this module the system selects the
bugs from the training set according to the attributes of the new bug. Only the selected features in the training data set, which are
given in the new bug, are selected for the classification process.
Predicting the Reduction Order:
The system predicts for each newly recorded bug, the order in which the data reduction techniques, i.e. instance selection and
feature selection processes are applied. Two orders are there: FS->IS and IS->FS. In FS->IS, the feature selection process is first
applied and the output of the feature selection process is applied as the input for the instance selection. The output of the instance
selection process will be the reduced training set. In IS->FS, the instance selection process is applied initially and the output of
the instance selection is applied as input for the feature selection. The prediction operation is performed ac-cording to the number
of available attributes for the newly reported bug. A binary classifier is used to per-form these two different orders. Here a
decision tree based algorithm is used as the binary classifier.
Classification:
The similarity measures are significant for the clustering and classification activities. These similarity measures are tested on the
K-Nearest Neighbour classification algorithm. KNN is one of the most popular methods for single-label classification in which a
document can belong to only one category. It classifies an unseen document by comparing it to its k nearest neighbours in a
Automated Bug Triage using Advanced Data Reduction Techniques
(J4R/ Volume 02 / Issue 06 / 005)
All rights reserved by www.journalforresearch.org 26
specified training set. Given a document d, let Dk, with corresponding label set Lk, be a set containing the k most similar
documents to d. Then d is classified to class c which appears most frequently in Lk. A random choice is made when a tie occurs.
In this module from the reduced dataset, after applying the training set reduction techniques, the most similar bug is selected.
In the proposed system classification is executed in two steps. In the first step the system filters bugs from the Instance selection
list with the number of words matching. In the next step from the feature selection list the system checks for the matching
features and chooses the bugs with the number of attribute matching. The common bugs are selected from the two lists. Then the
number of matching words and attributes are summed for each selected bug and created a new list which is sorted according to
the sum of Instance and Feature matching. Then from this list the bug id with the maximum sum value is selected. If the list
contains more than one bug with same value then the developer of the most recent bug is selected as the developer.
IV. EXPERIMENTS AND RESULTS
Data Set:
The bug repositories of two large open source projects, Eclipse and Mozilla Firefox are prepared for testing the proposed system.
In total, 73,070 bug de-tails of Eclipse and 64,265 bug details of Firefox are used for experimental analysis of this system.
Data Preparation:
The details of the Eclipse bugs are provided as JSON formatted le, while the Firefox bugs are pro-vided as XML formatted le by
the providers. The system imported these les into triaging system be-fore processing the bug data.
Data Reduction and Classification:
For the instance selection process, the proposed sys-tem applies a KNN based algorithm, while the similarity measures are used
for the feature selection process. From the total dataset 6,919 data are used as test data and the remaining 1,30,416 as training
dataset for evaluation of the system. It gets an overall 73% accuracy for the proposed system. Table 1 shows the performance
details of the proposed system on Eclipse and Firefox bug dataset.
Table – 1
Analysis Data
Tools Eclipse Firefox Overall
Total No. of Bugs 73,070 64,265 1,37,335
Training Data 69,752 60,664 1,30,416
Test Data 3,318 3,601 6,919
Correctly Triaged 2,167 2,908 5,075
Accuracy Percentage 65.3 80.75 73.03
V. CONCLUSION AND FUTURE WORK
Bug triage is an expensive step of software maintenance in both labour cost and time cost. In this bug triaging system, feature
selection and instance selection is combined to reduce the scale of training bug data sets as well as to improve the data quality. A
predictive model is built to determine the order of applying the reduction techniques by extracting the attributes. This system can
provide an approach, to leverage techniques on data processing, to form reduced and high-quality bug data in software
development and maintenance.
In the future work, the developer prioritization can be implemented during the classification process. This can provide a more
appropriate class for the new bug. The training set reduction of bug triage can be applied to other tasks to improve the software
quality. Since machine learning becomes one of the powerful tools in software engineering, the training set reduction can be
useful for the work based on machine learning.
REFERENCES
[1] J. Anvik, L. Hiew, and G. C. Murphy, Who should fix this bug?, in Proc. 28th Int. Conf. Softw. Eng., May 2006, pp. 361-370.
[2] T. Zimmermann, R. Premraj, N. Bettenburg, S. Just, A. Schroter, and C. Weiss, What makes a good bug report?", IEEE Trans. Softw. Eng., vol. 36, no. 5,
pp. 618-643, Oct. 2010.
[3] S. Kim, K. Pan, E. J. Whitehead, Jr., Memories of bug fixes", in Proc. ACM SIGSOFT Int. Symp. Found. Softw. Eng., 2006, pp. 3545.
[4] J. Xuan, H. Jiang, Z. Ren, and W. Zou, Devel-oper prioritization in bug repositories", in Proc. 34th Int. Conf. Softw. Eng., 2012, pp. 25-35.
[5] C. C. Aggarwal and P. Zhao, Towards graphical models for text processing", Knowl. Inform. Syst., vol. 36, no. 1, pp. 121, 2013.
[6] D. Cubranic and G. C. Murphy, Automatic bug triage using text categorization", in Proc. 16th Int. Conf. Softw. Eng. Knowl. Eng., Jun. 2004, pp. 92-97.
[7] W. Zou, Y. Hu, J. Xuan, and H. Jiang, Towards training set reduction for bug triage", in Proc. 35th Annu. IEEE Int. Comput. Soft. Appl. Conf., Jul. 2011,
pp. 576-581.
[8] H. Brighton and C. Mellish, Advances in in-stance selection for instance-based learning algorithms", Data Mining Knowl. Discovery, vol. 6, no. 2, pp.
153-172, Apr. 2002.
[9] Y. Yang and J. Pedersen, A comparative study on feature selection in text categorization", in Proc. Int. Conf. Mach. Learn., 1997, pp. 412-420.
Automated Bug Triage using Advanced Data Reduction Techniques
(J4R/ Volume 02 / Issue 06 / 005)
All rights reserved by www.journalforresearch.org 27
[10] M.Janaki Meena, K.R.Chandran, J.Mary Brinda and P.R.Sindhu,Enhancing Feature Selection Using Statistical Data with Unigrams and Bi-grams", in Int.
Journal of Comp. Appl. (0975 - 8887) vol. 1, No. 11, 2010, pp.7-11.
[11] G. Jeong, S. Kim, and T. Zimmermann, Improv-ing bug triage with tossing graphs, Proc. Joint Meeting European Software Engineering Conf. & ACM
SIGSOFT Symp. Foundations of Software Engineering (ESEC-FSE 09), ACM, Aug. 2009, pp. 111-120.
[12] J. Xuan, H. Jiang, Y. Hu, Z. Ren, W. Zou, Z. Luo, and X. Wu, Towards E_ective Bug Triage with Software Data Reduction Techniques", in IEEE Trans.
on Knowl and Data Engg., vol. 27, No. 1, Jan 2015, pp.264-280.
Ad

More Related Content

What's hot (20)

IRJET - Neural Network based Leaf Disease Detection and Remedy Recommenda...
IRJET -  	  Neural Network based Leaf Disease Detection and Remedy Recommenda...IRJET -  	  Neural Network based Leaf Disease Detection and Remedy Recommenda...
IRJET - Neural Network based Leaf Disease Detection and Remedy Recommenda...
IRJET Journal
 
Optimization of network traffic anomaly detection using machine learning
Optimization of network traffic anomaly detection using machine learning Optimization of network traffic anomaly detection using machine learning
Optimization of network traffic anomaly detection using machine learning
IJECEIAES
 
Effectiveness of test case
Effectiveness of test caseEffectiveness of test case
Effectiveness of test case
ijseajournal
 
Dc35579583
Dc35579583Dc35579583
Dc35579583
IJERA Editor
 
IRJET- Software Bug Prediction using Machine Learning Approach
IRJET- Software Bug Prediction using Machine Learning ApproachIRJET- Software Bug Prediction using Machine Learning Approach
IRJET- Software Bug Prediction using Machine Learning Approach
IRJET Journal
 
Abstract.doc
Abstract.docAbstract.doc
Abstract.doc
butest
 
A STUDY OF FORMULATION OF SOFTWARE TEST METRICS FOR INTERNET BASED APPLICATIONS
A STUDY OF FORMULATION OF SOFTWARE TEST METRICS FOR INTERNET BASED APPLICATIONSA STUDY OF FORMULATION OF SOFTWARE TEST METRICS FOR INTERNET BASED APPLICATIONS
A STUDY OF FORMULATION OF SOFTWARE TEST METRICS FOR INTERNET BASED APPLICATIONS
ecij
 
EFFECTIVE IMPLEMENTATION OF AGILE PRACTICES – OBJECT ORIENTED METRICS TOOL TO...
EFFECTIVE IMPLEMENTATION OF AGILE PRACTICES – OBJECT ORIENTED METRICS TOOL TO...EFFECTIVE IMPLEMENTATION OF AGILE PRACTICES – OBJECT ORIENTED METRICS TOOL TO...
EFFECTIVE IMPLEMENTATION OF AGILE PRACTICES – OBJECT ORIENTED METRICS TOOL TO...
ijseajournal
 
Automated exam question set generator using utility based agent and learning ...
Automated exam question set generator using utility based agent and learning ...Automated exam question set generator using utility based agent and learning ...
Automated exam question set generator using utility based agent and learning ...
Journal Papers
 
Bd36334337
Bd36334337Bd36334337
Bd36334337
IJERA Editor
 
Software testing strategy
Software testing strategySoftware testing strategy
Software testing strategy
ijseajournal
 
USING CATEGORICAL FEATURES IN MINING BUG TRACKING SYSTEMS TO ASSIGN BUG REPORTS
USING CATEGORICAL FEATURES IN MINING BUG TRACKING SYSTEMS TO ASSIGN BUG REPORTSUSING CATEGORICAL FEATURES IN MINING BUG TRACKING SYSTEMS TO ASSIGN BUG REPORTS
USING CATEGORICAL FEATURES IN MINING BUG TRACKING SYSTEMS TO ASSIGN BUG REPORTS
ijseajournal
 
A survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithmsA survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithms
Ahmed Magdy Ezzeldin, MSc.
 
Selecting Prominent API Calls and Labeling Malicious Samples for Effective Ma...
Selecting Prominent API Calls and Labeling Malicious Samples for Effective Ma...Selecting Prominent API Calls and Labeling Malicious Samples for Effective Ma...
Selecting Prominent API Calls and Labeling Malicious Samples for Effective Ma...
IJCSIS Research Publications
 
Test case prioritization using firefly algorithm for software testing
Test case prioritization using firefly algorithm for software testingTest case prioritization using firefly algorithm for software testing
Test case prioritization using firefly algorithm for software testing
Journal Papers
 
T0 numtq0nje=
T0 numtq0nje=T0 numtq0nje=
T0 numtq0nje=
International Journal of Science and Research (IJSR)
 
IRJET- Android Malware Detection using Deep Learning
IRJET- Android Malware Detection using Deep LearningIRJET- Android Malware Detection using Deep Learning
IRJET- Android Malware Detection using Deep Learning
IRJET Journal
 
TOWARDS PREDICTING SOFTWARE DEFECTS WITH CLUSTERING TECHNIQUES
TOWARDS PREDICTING SOFTWARE DEFECTS WITH CLUSTERING TECHNIQUESTOWARDS PREDICTING SOFTWARE DEFECTS WITH CLUSTERING TECHNIQUES
TOWARDS PREDICTING SOFTWARE DEFECTS WITH CLUSTERING TECHNIQUES
ijaia
 
Test case point analysis
Test case point analysisTest case point analysis
Test case point analysis
shahin kadir
 
Towards formulating dynamic model for predicting defects in system testing us...
Towards formulating dynamic model for predicting defects in system testing us...Towards formulating dynamic model for predicting defects in system testing us...
Towards formulating dynamic model for predicting defects in system testing us...
Journal Papers
 
IRJET - Neural Network based Leaf Disease Detection and Remedy Recommenda...
IRJET -  	  Neural Network based Leaf Disease Detection and Remedy Recommenda...IRJET -  	  Neural Network based Leaf Disease Detection and Remedy Recommenda...
IRJET - Neural Network based Leaf Disease Detection and Remedy Recommenda...
IRJET Journal
 
Optimization of network traffic anomaly detection using machine learning
Optimization of network traffic anomaly detection using machine learning Optimization of network traffic anomaly detection using machine learning
Optimization of network traffic anomaly detection using machine learning
IJECEIAES
 
Effectiveness of test case
Effectiveness of test caseEffectiveness of test case
Effectiveness of test case
ijseajournal
 
IRJET- Software Bug Prediction using Machine Learning Approach
IRJET- Software Bug Prediction using Machine Learning ApproachIRJET- Software Bug Prediction using Machine Learning Approach
IRJET- Software Bug Prediction using Machine Learning Approach
IRJET Journal
 
Abstract.doc
Abstract.docAbstract.doc
Abstract.doc
butest
 
A STUDY OF FORMULATION OF SOFTWARE TEST METRICS FOR INTERNET BASED APPLICATIONS
A STUDY OF FORMULATION OF SOFTWARE TEST METRICS FOR INTERNET BASED APPLICATIONSA STUDY OF FORMULATION OF SOFTWARE TEST METRICS FOR INTERNET BASED APPLICATIONS
A STUDY OF FORMULATION OF SOFTWARE TEST METRICS FOR INTERNET BASED APPLICATIONS
ecij
 
EFFECTIVE IMPLEMENTATION OF AGILE PRACTICES – OBJECT ORIENTED METRICS TOOL TO...
EFFECTIVE IMPLEMENTATION OF AGILE PRACTICES – OBJECT ORIENTED METRICS TOOL TO...EFFECTIVE IMPLEMENTATION OF AGILE PRACTICES – OBJECT ORIENTED METRICS TOOL TO...
EFFECTIVE IMPLEMENTATION OF AGILE PRACTICES – OBJECT ORIENTED METRICS TOOL TO...
ijseajournal
 
Automated exam question set generator using utility based agent and learning ...
Automated exam question set generator using utility based agent and learning ...Automated exam question set generator using utility based agent and learning ...
Automated exam question set generator using utility based agent and learning ...
Journal Papers
 
Software testing strategy
Software testing strategySoftware testing strategy
Software testing strategy
ijseajournal
 
USING CATEGORICAL FEATURES IN MINING BUG TRACKING SYSTEMS TO ASSIGN BUG REPORTS
USING CATEGORICAL FEATURES IN MINING BUG TRACKING SYSTEMS TO ASSIGN BUG REPORTSUSING CATEGORICAL FEATURES IN MINING BUG TRACKING SYSTEMS TO ASSIGN BUG REPORTS
USING CATEGORICAL FEATURES IN MINING BUG TRACKING SYSTEMS TO ASSIGN BUG REPORTS
ijseajournal
 
A survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithmsA survey of fault prediction using machine learning algorithms
A survey of fault prediction using machine learning algorithms
Ahmed Magdy Ezzeldin, MSc.
 
Selecting Prominent API Calls and Labeling Malicious Samples for Effective Ma...
Selecting Prominent API Calls and Labeling Malicious Samples for Effective Ma...Selecting Prominent API Calls and Labeling Malicious Samples for Effective Ma...
Selecting Prominent API Calls and Labeling Malicious Samples for Effective Ma...
IJCSIS Research Publications
 
Test case prioritization using firefly algorithm for software testing
Test case prioritization using firefly algorithm for software testingTest case prioritization using firefly algorithm for software testing
Test case prioritization using firefly algorithm for software testing
Journal Papers
 
IRJET- Android Malware Detection using Deep Learning
IRJET- Android Malware Detection using Deep LearningIRJET- Android Malware Detection using Deep Learning
IRJET- Android Malware Detection using Deep Learning
IRJET Journal
 
TOWARDS PREDICTING SOFTWARE DEFECTS WITH CLUSTERING TECHNIQUES
TOWARDS PREDICTING SOFTWARE DEFECTS WITH CLUSTERING TECHNIQUESTOWARDS PREDICTING SOFTWARE DEFECTS WITH CLUSTERING TECHNIQUES
TOWARDS PREDICTING SOFTWARE DEFECTS WITH CLUSTERING TECHNIQUES
ijaia
 
Test case point analysis
Test case point analysisTest case point analysis
Test case point analysis
shahin kadir
 
Towards formulating dynamic model for predicting defects in system testing us...
Towards formulating dynamic model for predicting defects in system testing us...Towards formulating dynamic model for predicting defects in system testing us...
Towards formulating dynamic model for predicting defects in system testing us...
Journal Papers
 

Viewers also liked (14)

Why Do We Get Disturbed ?
Why Do We Get Disturbed ?Why Do We Get Disturbed ?
Why Do We Get Disturbed ?
Amit Karkare
 
Deforestación en venezuela.
Deforestación en venezuela.Deforestación en venezuela.
Deforestación en venezuela.
Rafael Bastardo Vera
 
GREENHOUSE CULTIVATION
GREENHOUSE CULTIVATIONGREENHOUSE CULTIVATION
GREENHOUSE CULTIVATION
Journal For Research
 
Hack & Beers - Seguridad en Drupal
Hack & Beers - Seguridad en DrupalHack & Beers - Seguridad en Drupal
Hack & Beers - Seguridad en Drupal
zekivazquez
 
Born To Lead - How to Lead Your Team to a Better Tomorrow
Born To Lead - How to Lead Your Team to a Better TomorrowBorn To Lead - How to Lead Your Team to a Better Tomorrow
Born To Lead - How to Lead Your Team to a Better Tomorrow
SMA Equities
 
143 Ludlow Street Lower East Side owned by Samy Mahfar of SMA Equities
143 Ludlow Street Lower East Side owned by Samy Mahfar of SMA Equities143 Ludlow Street Lower East Side owned by Samy Mahfar of SMA Equities
143 Ludlow Street Lower East Side owned by Samy Mahfar of SMA Equities
SMA Equities
 
Elementos estruturais
Elementos estruturaisElementos estruturais
Elementos estruturais
Ingrid Lorrana Ferreira Sobreira
 
Obesity
ObesityObesity
Obesity
DR . RAJESH CHOUDHURI
 
Economía
Economía Economía
Economía
Rocio Uresti
 
Phosphorus
PhosphorusPhosphorus
Phosphorus
Amit Karkare
 
10 herramientas de un técnico de sistemas
10 herramientas de un técnico de sistemas10 herramientas de un técnico de sistemas
10 herramientas de un técnico de sistemas
jhoan sebastian polo
 
Presentation3
Presentation3Presentation3
Presentation3
Vedanssh Dalmia
 
Slideserve.co.uk dok holidej 009 - zlato gvadalupe.pdf
Slideserve.co.uk dok holidej 009 - zlato gvadalupe.pdfSlideserve.co.uk dok holidej 009 - zlato gvadalupe.pdf
Slideserve.co.uk dok holidej 009 - zlato gvadalupe.pdf
zoran radovic
 
Slideserve.co.uk dok holidej 010 - strelac iz kalifornije.pdf
Slideserve.co.uk dok holidej 010 - strelac iz kalifornije.pdfSlideserve.co.uk dok holidej 010 - strelac iz kalifornije.pdf
Slideserve.co.uk dok holidej 010 - strelac iz kalifornije.pdf
zoran radovic
 
Why Do We Get Disturbed ?
Why Do We Get Disturbed ?Why Do We Get Disturbed ?
Why Do We Get Disturbed ?
Amit Karkare
 
Hack & Beers - Seguridad en Drupal
Hack & Beers - Seguridad en DrupalHack & Beers - Seguridad en Drupal
Hack & Beers - Seguridad en Drupal
zekivazquez
 
Born To Lead - How to Lead Your Team to a Better Tomorrow
Born To Lead - How to Lead Your Team to a Better TomorrowBorn To Lead - How to Lead Your Team to a Better Tomorrow
Born To Lead - How to Lead Your Team to a Better Tomorrow
SMA Equities
 
143 Ludlow Street Lower East Side owned by Samy Mahfar of SMA Equities
143 Ludlow Street Lower East Side owned by Samy Mahfar of SMA Equities143 Ludlow Street Lower East Side owned by Samy Mahfar of SMA Equities
143 Ludlow Street Lower East Side owned by Samy Mahfar of SMA Equities
SMA Equities
 
10 herramientas de un técnico de sistemas
10 herramientas de un técnico de sistemas10 herramientas de un técnico de sistemas
10 herramientas de un técnico de sistemas
jhoan sebastian polo
 
Slideserve.co.uk dok holidej 009 - zlato gvadalupe.pdf
Slideserve.co.uk dok holidej 009 - zlato gvadalupe.pdfSlideserve.co.uk dok holidej 009 - zlato gvadalupe.pdf
Slideserve.co.uk dok holidej 009 - zlato gvadalupe.pdf
zoran radovic
 
Slideserve.co.uk dok holidej 010 - strelac iz kalifornije.pdf
Slideserve.co.uk dok holidej 010 - strelac iz kalifornije.pdfSlideserve.co.uk dok holidej 010 - strelac iz kalifornije.pdf
Slideserve.co.uk dok holidej 010 - strelac iz kalifornije.pdf
zoran radovic
 
Ad

Similar to AUTOMATED BUG TRIAGE USING ADVANCED DATA REDUCTION TECHNIQUES (20)

Towards effective bug triage with software
Towards effective bug triage with softwareTowards effective bug triage with software
Towards effective bug triage with software
Nexgen Technology
 
IRJET- Data Reduction in Bug Triage using Supervised Machine Learning
IRJET- Data Reduction in Bug Triage using Supervised Machine LearningIRJET- Data Reduction in Bug Triage using Supervised Machine Learning
IRJET- Data Reduction in Bug Triage using Supervised Machine Learning
IRJET Journal
 
TOWARDS EFFECTIVE BUG TRIAGE WITH SOFTWARE DATA REDUCTION TECHNIQUES
TOWARDS EFFECTIVE BUG TRIAGE WITH SOFTWARE DATA REDUCTION TECHNIQUESTOWARDS EFFECTIVE BUG TRIAGE WITH SOFTWARE DATA REDUCTION TECHNIQUES
TOWARDS EFFECTIVE BUG TRIAGE WITH SOFTWARE DATA REDUCTION TECHNIQUES
Shakas Technologies
 
A NOVEL APPROACH TO ERROR DETECTION AND CORRECTION OF C PROGRAMS USING MACHIN...
A NOVEL APPROACH TO ERROR DETECTION AND CORRECTION OF C PROGRAMS USING MACHIN...A NOVEL APPROACH TO ERROR DETECTION AND CORRECTION OF C PROGRAMS USING MACHIN...
A NOVEL APPROACH TO ERROR DETECTION AND CORRECTION OF C PROGRAMS USING MACHIN...
IJCI JOURNAL
 
Towards Effective Bug Triage with Software Data Reduction Techniques
Towards Effective Bug Triage with Software Data Reduction TechniquesTowards Effective Bug Triage with Software Data Reduction Techniques
Towards Effective Bug Triage with Software Data Reduction Techniques
1crore projects
 
Towards effective bug triage with software data reduction techniques
Towards effective bug triage with software data reduction techniquesTowards effective bug triage with software data reduction techniques
Towards effective bug triage with software data reduction techniques
redpel dot com
 
F017652530
F017652530F017652530
F017652530
IOSR Journals
 
A Review on Software Fault Detection and Prevention Mechanism in Software Dev...
A Review on Software Fault Detection and Prevention Mechanism in Software Dev...A Review on Software Fault Detection and Prevention Mechanism in Software Dev...
A Review on Software Fault Detection and Prevention Mechanism in Software Dev...
iosrjce
 
Towards effective bug triage with software data reduction techniques
Towards effective bug triage with software data reduction techniquesTowards effective bug triage with software data reduction techniques
Towards effective bug triage with software data reduction techniques
Pvrtechnologies Nellore
 
Nature-Based Prediction Model of Bug Reports Based on Ensemble Machine Learni...
Nature-Based Prediction Model of Bug Reports Based on Ensemble Machine Learni...Nature-Based Prediction Model of Bug Reports Based on Ensemble Machine Learni...
Nature-Based Prediction Model of Bug Reports Based on Ensemble Machine Learni...
Shakas Technologies
 
IRJET- A Detailed Analysis on Windows Event Log Viewer for Faster Root Ca...
IRJET-  	  A Detailed Analysis on Windows Event Log Viewer for Faster Root Ca...IRJET-  	  A Detailed Analysis on Windows Event Log Viewer for Faster Root Ca...
IRJET- A Detailed Analysis on Windows Event Log Viewer for Faster Root Ca...
IRJET Journal
 
J034057065
J034057065J034057065
J034057065
ijceronline
 
Multi step automated refactoring for code smell
Multi step automated refactoring for code smellMulti step automated refactoring for code smell
Multi step automated refactoring for code smell
eSAT Journals
 
Multi step automated refactoring for code smell
Multi step automated refactoring for code smellMulti step automated refactoring for code smell
Multi step automated refactoring for code smell
eSAT Publishing House
 
Survey on Software Data Reduction Techniques Accomplishing Bug Triage
Survey on Software Data Reduction Techniques Accomplishing Bug TriageSurvey on Software Data Reduction Techniques Accomplishing Bug Triage
Survey on Software Data Reduction Techniques Accomplishing Bug Triage
IRJET Journal
 
A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...
IOSR Journals
 
Knowledge and Data Engineering IEEE 2015 Projects
Knowledge and Data Engineering IEEE 2015 ProjectsKnowledge and Data Engineering IEEE 2015 Projects
Knowledge and Data Engineering IEEE 2015 Projects
Vijay Karan
 
Parameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE Method
Parameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE MethodParameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE Method
Parameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE Method
IRJET Journal
 
A Novel Approach to Improve Software Defect Prediction Accuracy Using Machine...
A Novel Approach to Improve Software Defect Prediction Accuracy Using Machine...A Novel Approach to Improve Software Defect Prediction Accuracy Using Machine...
A Novel Approach to Improve Software Defect Prediction Accuracy Using Machine...
Shakas Technologies
 
IRJET- Automated Test Case Generation using Data Mining
IRJET- Automated Test Case Generation using Data MiningIRJET- Automated Test Case Generation using Data Mining
IRJET- Automated Test Case Generation using Data Mining
IRJET Journal
 
Towards effective bug triage with software
Towards effective bug triage with softwareTowards effective bug triage with software
Towards effective bug triage with software
Nexgen Technology
 
IRJET- Data Reduction in Bug Triage using Supervised Machine Learning
IRJET- Data Reduction in Bug Triage using Supervised Machine LearningIRJET- Data Reduction in Bug Triage using Supervised Machine Learning
IRJET- Data Reduction in Bug Triage using Supervised Machine Learning
IRJET Journal
 
TOWARDS EFFECTIVE BUG TRIAGE WITH SOFTWARE DATA REDUCTION TECHNIQUES
TOWARDS EFFECTIVE BUG TRIAGE WITH SOFTWARE DATA REDUCTION TECHNIQUESTOWARDS EFFECTIVE BUG TRIAGE WITH SOFTWARE DATA REDUCTION TECHNIQUES
TOWARDS EFFECTIVE BUG TRIAGE WITH SOFTWARE DATA REDUCTION TECHNIQUES
Shakas Technologies
 
A NOVEL APPROACH TO ERROR DETECTION AND CORRECTION OF C PROGRAMS USING MACHIN...
A NOVEL APPROACH TO ERROR DETECTION AND CORRECTION OF C PROGRAMS USING MACHIN...A NOVEL APPROACH TO ERROR DETECTION AND CORRECTION OF C PROGRAMS USING MACHIN...
A NOVEL APPROACH TO ERROR DETECTION AND CORRECTION OF C PROGRAMS USING MACHIN...
IJCI JOURNAL
 
Towards Effective Bug Triage with Software Data Reduction Techniques
Towards Effective Bug Triage with Software Data Reduction TechniquesTowards Effective Bug Triage with Software Data Reduction Techniques
Towards Effective Bug Triage with Software Data Reduction Techniques
1crore projects
 
Towards effective bug triage with software data reduction techniques
Towards effective bug triage with software data reduction techniquesTowards effective bug triage with software data reduction techniques
Towards effective bug triage with software data reduction techniques
redpel dot com
 
A Review on Software Fault Detection and Prevention Mechanism in Software Dev...
A Review on Software Fault Detection and Prevention Mechanism in Software Dev...A Review on Software Fault Detection and Prevention Mechanism in Software Dev...
A Review on Software Fault Detection and Prevention Mechanism in Software Dev...
iosrjce
 
Towards effective bug triage with software data reduction techniques
Towards effective bug triage with software data reduction techniquesTowards effective bug triage with software data reduction techniques
Towards effective bug triage with software data reduction techniques
Pvrtechnologies Nellore
 
Nature-Based Prediction Model of Bug Reports Based on Ensemble Machine Learni...
Nature-Based Prediction Model of Bug Reports Based on Ensemble Machine Learni...Nature-Based Prediction Model of Bug Reports Based on Ensemble Machine Learni...
Nature-Based Prediction Model of Bug Reports Based on Ensemble Machine Learni...
Shakas Technologies
 
IRJET- A Detailed Analysis on Windows Event Log Viewer for Faster Root Ca...
IRJET-  	  A Detailed Analysis on Windows Event Log Viewer for Faster Root Ca...IRJET-  	  A Detailed Analysis on Windows Event Log Viewer for Faster Root Ca...
IRJET- A Detailed Analysis on Windows Event Log Viewer for Faster Root Ca...
IRJET Journal
 
Multi step automated refactoring for code smell
Multi step automated refactoring for code smellMulti step automated refactoring for code smell
Multi step automated refactoring for code smell
eSAT Journals
 
Multi step automated refactoring for code smell
Multi step automated refactoring for code smellMulti step automated refactoring for code smell
Multi step automated refactoring for code smell
eSAT Publishing House
 
Survey on Software Data Reduction Techniques Accomplishing Bug Triage
Survey on Software Data Reduction Techniques Accomplishing Bug TriageSurvey on Software Data Reduction Techniques Accomplishing Bug Triage
Survey on Software Data Reduction Techniques Accomplishing Bug Triage
IRJET Journal
 
A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...
IOSR Journals
 
Knowledge and Data Engineering IEEE 2015 Projects
Knowledge and Data Engineering IEEE 2015 ProjectsKnowledge and Data Engineering IEEE 2015 Projects
Knowledge and Data Engineering IEEE 2015 Projects
Vijay Karan
 
Parameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE Method
Parameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE MethodParameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE Method
Parameter Estimation of GOEL-OKUMOTO Model by Comparing ACO with MLE Method
IRJET Journal
 
A Novel Approach to Improve Software Defect Prediction Accuracy Using Machine...
A Novel Approach to Improve Software Defect Prediction Accuracy Using Machine...A Novel Approach to Improve Software Defect Prediction Accuracy Using Machine...
A Novel Approach to Improve Software Defect Prediction Accuracy Using Machine...
Shakas Technologies
 
IRJET- Automated Test Case Generation using Data Mining
IRJET- Automated Test Case Generation using Data MiningIRJET- Automated Test Case Generation using Data Mining
IRJET- Automated Test Case Generation using Data Mining
IRJET Journal
 
Ad

More from Journal For Research (20)

Design and Analysis of Hydraulic Actuator in a Typical Aerospace vehicle | J4...
Design and Analysis of Hydraulic Actuator in a Typical Aerospace vehicle | J4...Design and Analysis of Hydraulic Actuator in a Typical Aerospace vehicle | J4...
Design and Analysis of Hydraulic Actuator in a Typical Aerospace vehicle | J4...
Journal For Research
 
Experimental Verification and Validation of Stress Distribution of Composite ...
Experimental Verification and Validation of Stress Distribution of Composite ...Experimental Verification and Validation of Stress Distribution of Composite ...
Experimental Verification and Validation of Stress Distribution of Composite ...
Journal For Research
 
Image Binarization for the uses of Preprocessing to Detect Brain Abnormality ...
Image Binarization for the uses of Preprocessing to Detect Brain Abnormality ...Image Binarization for the uses of Preprocessing to Detect Brain Abnormality ...
Image Binarization for the uses of Preprocessing to Detect Brain Abnormality ...
Journal For Research
 
A Research Paper on BFO and PSO Based Movie Recommendation System | J4RV4I1016
A Research Paper on BFO and PSO Based Movie Recommendation System | J4RV4I1016A Research Paper on BFO and PSO Based Movie Recommendation System | J4RV4I1016
A Research Paper on BFO and PSO Based Movie Recommendation System | J4RV4I1016
Journal For Research
 
IoT based Digital Agriculture Monitoring System and Their Impact on Optimal U...
IoT based Digital Agriculture Monitoring System and Their Impact on Optimal U...IoT based Digital Agriculture Monitoring System and Their Impact on Optimal U...
IoT based Digital Agriculture Monitoring System and Their Impact on Optimal U...
Journal For Research
 
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
Journal For Research
 
HCI BASED APPLICATION FOR PLAYING COMPUTER GAMES | J4RV4I1014
HCI BASED APPLICATION FOR PLAYING COMPUTER GAMES | J4RV4I1014HCI BASED APPLICATION FOR PLAYING COMPUTER GAMES | J4RV4I1014
HCI BASED APPLICATION FOR PLAYING COMPUTER GAMES | J4RV4I1014
Journal For Research
 
A REVIEW ON DESIGN OF PUBLIC TRANSPORTATION SYSTEM IN CHANDRAPUR CITY | J4RV4...
A REVIEW ON DESIGN OF PUBLIC TRANSPORTATION SYSTEM IN CHANDRAPUR CITY | J4RV4...A REVIEW ON DESIGN OF PUBLIC TRANSPORTATION SYSTEM IN CHANDRAPUR CITY | J4RV4...
A REVIEW ON DESIGN OF PUBLIC TRANSPORTATION SYSTEM IN CHANDRAPUR CITY | J4RV4...
Journal For Research
 
A REVIEW ON LIFTING AND ASSEMBLY OF ROTARY KILN TYRE WITH SHELL BY FLEXIBLE G...
A REVIEW ON LIFTING AND ASSEMBLY OF ROTARY KILN TYRE WITH SHELL BY FLEXIBLE G...A REVIEW ON LIFTING AND ASSEMBLY OF ROTARY KILN TYRE WITH SHELL BY FLEXIBLE G...
A REVIEW ON LIFTING AND ASSEMBLY OF ROTARY KILN TYRE WITH SHELL BY FLEXIBLE G...
Journal For Research
 
LABORATORY STUDY OF STRONG, MODERATE AND WEAK SANDSTONES | J4RV4I1012
LABORATORY STUDY OF STRONG, MODERATE AND WEAK SANDSTONES | J4RV4I1012LABORATORY STUDY OF STRONG, MODERATE AND WEAK SANDSTONES | J4RV4I1012
LABORATORY STUDY OF STRONG, MODERATE AND WEAK SANDSTONES | J4RV4I1012
Journal For Research
 
DESIGN ANALYSIS AND FABRICATION OF MANUAL RICE TRANSPLANTING MACHINE | J4RV4I...
DESIGN ANALYSIS AND FABRICATION OF MANUAL RICE TRANSPLANTING MACHINE | J4RV4I...DESIGN ANALYSIS AND FABRICATION OF MANUAL RICE TRANSPLANTING MACHINE | J4RV4I...
DESIGN ANALYSIS AND FABRICATION OF MANUAL RICE TRANSPLANTING MACHINE | J4RV4I...
Journal For Research
 
AN OVERVIEW: DAKNET TECHNOLOGY - BROADBAND AD-HOC CONNECTIVITY | J4RV4I1009
AN OVERVIEW: DAKNET TECHNOLOGY - BROADBAND AD-HOC CONNECTIVITY | J4RV4I1009AN OVERVIEW: DAKNET TECHNOLOGY - BROADBAND AD-HOC CONNECTIVITY | J4RV4I1009
AN OVERVIEW: DAKNET TECHNOLOGY - BROADBAND AD-HOC CONNECTIVITY | J4RV4I1009
Journal For Research
 
LINE FOLLOWER ROBOT | J4RV4I1010
LINE FOLLOWER ROBOT | J4RV4I1010LINE FOLLOWER ROBOT | J4RV4I1010
LINE FOLLOWER ROBOT | J4RV4I1010
Journal For Research
 
CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008
CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008
CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008
Journal For Research
 
AN INTEGRATED APPROACH TO REDUCE INTRA CITY TRAFFIC AT COIMBATORE | J4RV4I1002
AN INTEGRATED APPROACH TO REDUCE INTRA CITY TRAFFIC AT COIMBATORE | J4RV4I1002AN INTEGRATED APPROACH TO REDUCE INTRA CITY TRAFFIC AT COIMBATORE | J4RV4I1002
AN INTEGRATED APPROACH TO REDUCE INTRA CITY TRAFFIC AT COIMBATORE | J4RV4I1002
Journal For Research
 
A REVIEW STUDY ON GAS-SOLID CYCLONE SEPARATOR USING LAPPLE MODEL | J4RV4I1001
A REVIEW STUDY ON GAS-SOLID CYCLONE SEPARATOR USING LAPPLE MODEL | J4RV4I1001A REVIEW STUDY ON GAS-SOLID CYCLONE SEPARATOR USING LAPPLE MODEL | J4RV4I1001
A REVIEW STUDY ON GAS-SOLID CYCLONE SEPARATOR USING LAPPLE MODEL | J4RV4I1001
Journal For Research
 
IMAGE SEGMENTATION USING FCM ALGORITM | J4RV3I12021
IMAGE SEGMENTATION USING FCM ALGORITM | J4RV3I12021IMAGE SEGMENTATION USING FCM ALGORITM | J4RV3I12021
IMAGE SEGMENTATION USING FCM ALGORITM | J4RV3I12021
Journal For Research
 
USE OF GALVANIZED STEELS FOR AUTOMOTIVE BODY- CAR SURVEY RESULTS AT COASTAL A...
USE OF GALVANIZED STEELS FOR AUTOMOTIVE BODY- CAR SURVEY RESULTS AT COASTAL A...USE OF GALVANIZED STEELS FOR AUTOMOTIVE BODY- CAR SURVEY RESULTS AT COASTAL A...
USE OF GALVANIZED STEELS FOR AUTOMOTIVE BODY- CAR SURVEY RESULTS AT COASTAL A...
Journal For Research
 
UNMANNED AERIAL VEHICLE FOR REMITTANCE | J4RV3I12023
UNMANNED AERIAL VEHICLE FOR REMITTANCE | J4RV3I12023UNMANNED AERIAL VEHICLE FOR REMITTANCE | J4RV3I12023
UNMANNED AERIAL VEHICLE FOR REMITTANCE | J4RV3I12023
Journal For Research
 
SURVEY ON A MODERN MEDICARE SYSTEM USING INTERNET OF THINGS | J4RV3I12024
SURVEY ON A MODERN MEDICARE SYSTEM USING INTERNET OF THINGS | J4RV3I12024SURVEY ON A MODERN MEDICARE SYSTEM USING INTERNET OF THINGS | J4RV3I12024
SURVEY ON A MODERN MEDICARE SYSTEM USING INTERNET OF THINGS | J4RV3I12024
Journal For Research
 
Design and Analysis of Hydraulic Actuator in a Typical Aerospace vehicle | J4...
Design and Analysis of Hydraulic Actuator in a Typical Aerospace vehicle | J4...Design and Analysis of Hydraulic Actuator in a Typical Aerospace vehicle | J4...
Design and Analysis of Hydraulic Actuator in a Typical Aerospace vehicle | J4...
Journal For Research
 
Experimental Verification and Validation of Stress Distribution of Composite ...
Experimental Verification and Validation of Stress Distribution of Composite ...Experimental Verification and Validation of Stress Distribution of Composite ...
Experimental Verification and Validation of Stress Distribution of Composite ...
Journal For Research
 
Image Binarization for the uses of Preprocessing to Detect Brain Abnormality ...
Image Binarization for the uses of Preprocessing to Detect Brain Abnormality ...Image Binarization for the uses of Preprocessing to Detect Brain Abnormality ...
Image Binarization for the uses of Preprocessing to Detect Brain Abnormality ...
Journal For Research
 
A Research Paper on BFO and PSO Based Movie Recommendation System | J4RV4I1016
A Research Paper on BFO and PSO Based Movie Recommendation System | J4RV4I1016A Research Paper on BFO and PSO Based Movie Recommendation System | J4RV4I1016
A Research Paper on BFO and PSO Based Movie Recommendation System | J4RV4I1016
Journal For Research
 
IoT based Digital Agriculture Monitoring System and Their Impact on Optimal U...
IoT based Digital Agriculture Monitoring System and Their Impact on Optimal U...IoT based Digital Agriculture Monitoring System and Their Impact on Optimal U...
IoT based Digital Agriculture Monitoring System and Their Impact on Optimal U...
Journal For Research
 
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
A REVIEW PAPER ON BFO AND PSO BASED MOVIE RECOMMENDATION SYSTEM | J4RV4I1015
Journal For Research
 
HCI BASED APPLICATION FOR PLAYING COMPUTER GAMES | J4RV4I1014
HCI BASED APPLICATION FOR PLAYING COMPUTER GAMES | J4RV4I1014HCI BASED APPLICATION FOR PLAYING COMPUTER GAMES | J4RV4I1014
HCI BASED APPLICATION FOR PLAYING COMPUTER GAMES | J4RV4I1014
Journal For Research
 
A REVIEW ON DESIGN OF PUBLIC TRANSPORTATION SYSTEM IN CHANDRAPUR CITY | J4RV4...
A REVIEW ON DESIGN OF PUBLIC TRANSPORTATION SYSTEM IN CHANDRAPUR CITY | J4RV4...A REVIEW ON DESIGN OF PUBLIC TRANSPORTATION SYSTEM IN CHANDRAPUR CITY | J4RV4...
A REVIEW ON DESIGN OF PUBLIC TRANSPORTATION SYSTEM IN CHANDRAPUR CITY | J4RV4...
Journal For Research
 
A REVIEW ON LIFTING AND ASSEMBLY OF ROTARY KILN TYRE WITH SHELL BY FLEXIBLE G...
A REVIEW ON LIFTING AND ASSEMBLY OF ROTARY KILN TYRE WITH SHELL BY FLEXIBLE G...A REVIEW ON LIFTING AND ASSEMBLY OF ROTARY KILN TYRE WITH SHELL BY FLEXIBLE G...
A REVIEW ON LIFTING AND ASSEMBLY OF ROTARY KILN TYRE WITH SHELL BY FLEXIBLE G...
Journal For Research
 
LABORATORY STUDY OF STRONG, MODERATE AND WEAK SANDSTONES | J4RV4I1012
LABORATORY STUDY OF STRONG, MODERATE AND WEAK SANDSTONES | J4RV4I1012LABORATORY STUDY OF STRONG, MODERATE AND WEAK SANDSTONES | J4RV4I1012
LABORATORY STUDY OF STRONG, MODERATE AND WEAK SANDSTONES | J4RV4I1012
Journal For Research
 
DESIGN ANALYSIS AND FABRICATION OF MANUAL RICE TRANSPLANTING MACHINE | J4RV4I...
DESIGN ANALYSIS AND FABRICATION OF MANUAL RICE TRANSPLANTING MACHINE | J4RV4I...DESIGN ANALYSIS AND FABRICATION OF MANUAL RICE TRANSPLANTING MACHINE | J4RV4I...
DESIGN ANALYSIS AND FABRICATION OF MANUAL RICE TRANSPLANTING MACHINE | J4RV4I...
Journal For Research
 
AN OVERVIEW: DAKNET TECHNOLOGY - BROADBAND AD-HOC CONNECTIVITY | J4RV4I1009
AN OVERVIEW: DAKNET TECHNOLOGY - BROADBAND AD-HOC CONNECTIVITY | J4RV4I1009AN OVERVIEW: DAKNET TECHNOLOGY - BROADBAND AD-HOC CONNECTIVITY | J4RV4I1009
AN OVERVIEW: DAKNET TECHNOLOGY - BROADBAND AD-HOC CONNECTIVITY | J4RV4I1009
Journal For Research
 
CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008
CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008
CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008
Journal For Research
 
AN INTEGRATED APPROACH TO REDUCE INTRA CITY TRAFFIC AT COIMBATORE | J4RV4I1002
AN INTEGRATED APPROACH TO REDUCE INTRA CITY TRAFFIC AT COIMBATORE | J4RV4I1002AN INTEGRATED APPROACH TO REDUCE INTRA CITY TRAFFIC AT COIMBATORE | J4RV4I1002
AN INTEGRATED APPROACH TO REDUCE INTRA CITY TRAFFIC AT COIMBATORE | J4RV4I1002
Journal For Research
 
A REVIEW STUDY ON GAS-SOLID CYCLONE SEPARATOR USING LAPPLE MODEL | J4RV4I1001
A REVIEW STUDY ON GAS-SOLID CYCLONE SEPARATOR USING LAPPLE MODEL | J4RV4I1001A REVIEW STUDY ON GAS-SOLID CYCLONE SEPARATOR USING LAPPLE MODEL | J4RV4I1001
A REVIEW STUDY ON GAS-SOLID CYCLONE SEPARATOR USING LAPPLE MODEL | J4RV4I1001
Journal For Research
 
IMAGE SEGMENTATION USING FCM ALGORITM | J4RV3I12021
IMAGE SEGMENTATION USING FCM ALGORITM | J4RV3I12021IMAGE SEGMENTATION USING FCM ALGORITM | J4RV3I12021
IMAGE SEGMENTATION USING FCM ALGORITM | J4RV3I12021
Journal For Research
 
USE OF GALVANIZED STEELS FOR AUTOMOTIVE BODY- CAR SURVEY RESULTS AT COASTAL A...
USE OF GALVANIZED STEELS FOR AUTOMOTIVE BODY- CAR SURVEY RESULTS AT COASTAL A...USE OF GALVANIZED STEELS FOR AUTOMOTIVE BODY- CAR SURVEY RESULTS AT COASTAL A...
USE OF GALVANIZED STEELS FOR AUTOMOTIVE BODY- CAR SURVEY RESULTS AT COASTAL A...
Journal For Research
 
UNMANNED AERIAL VEHICLE FOR REMITTANCE | J4RV3I12023
UNMANNED AERIAL VEHICLE FOR REMITTANCE | J4RV3I12023UNMANNED AERIAL VEHICLE FOR REMITTANCE | J4RV3I12023
UNMANNED AERIAL VEHICLE FOR REMITTANCE | J4RV3I12023
Journal For Research
 
SURVEY ON A MODERN MEDICARE SYSTEM USING INTERNET OF THINGS | J4RV3I12024
SURVEY ON A MODERN MEDICARE SYSTEM USING INTERNET OF THINGS | J4RV3I12024SURVEY ON A MODERN MEDICARE SYSTEM USING INTERNET OF THINGS | J4RV3I12024
SURVEY ON A MODERN MEDICARE SYSTEM USING INTERNET OF THINGS | J4RV3I12024
Journal For Research
 

Recently uploaded (20)

PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)
PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)
PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)
Dr. Nasir Mustafa
 
How to Configure Public Holidays & Mandatory Days in Odoo 18
How to Configure Public Holidays & Mandatory Days in Odoo 18How to Configure Public Holidays & Mandatory Days in Odoo 18
How to Configure Public Holidays & Mandatory Days in Odoo 18
Celine George
 
What is the Philosophy of Statistics? (and how I was drawn to it)
What is the Philosophy of Statistics? (and how I was drawn to it)What is the Philosophy of Statistics? (and how I was drawn to it)
What is the Philosophy of Statistics? (and how I was drawn to it)
jemille6
 
*"The Segmented Blueprint: Unlocking Insect Body Architecture"*.pptx
*"The Segmented Blueprint: Unlocking Insect Body Architecture"*.pptx*"The Segmented Blueprint: Unlocking Insect Body Architecture"*.pptx
*"The Segmented Blueprint: Unlocking Insect Body Architecture"*.pptx
Arshad Shaikh
 
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
Celine George
 
How to Manage Upselling in Odoo 18 Sales
How to Manage Upselling in Odoo 18 SalesHow to Manage Upselling in Odoo 18 Sales
How to Manage Upselling in Odoo 18 Sales
Celine George
 
LDMMIA Reiki News Ed3 Vol1 For Team and Guests
LDMMIA Reiki News Ed3 Vol1 For Team and GuestsLDMMIA Reiki News Ed3 Vol1 For Team and Guests
LDMMIA Reiki News Ed3 Vol1 For Team and Guests
LDM Mia eStudios
 
Cultivation Practice of Turmeric in Nepal.pptx
Cultivation Practice of Turmeric in Nepal.pptxCultivation Practice of Turmeric in Nepal.pptx
Cultivation Practice of Turmeric in Nepal.pptx
UmeshTimilsina1
 
*"Sensing the World: Insect Sensory Systems"*
*"Sensing the World: Insect Sensory Systems"**"Sensing the World: Insect Sensory Systems"*
*"Sensing the World: Insect Sensory Systems"*
Arshad Shaikh
 
spinal cord disorders (Myelopathies and radiculoapthies)
spinal cord disorders (Myelopathies and radiculoapthies)spinal cord disorders (Myelopathies and radiculoapthies)
spinal cord disorders (Myelopathies and radiculoapthies)
Mohamed Rizk Khodair
 
Botany Assignment Help Guide - Academic Excellence
Botany Assignment Help Guide - Academic ExcellenceBotany Assignment Help Guide - Academic Excellence
Botany Assignment Help Guide - Academic Excellence
online college homework help
 
All About the 990 Unlocking Its Mysteries and Its Power.pdf
All About the 990 Unlocking Its Mysteries and Its Power.pdfAll About the 990 Unlocking Its Mysteries and Its Power.pdf
All About the 990 Unlocking Its Mysteries and Its Power.pdf
TechSoup
 
Final Evaluation.docx...........................
Final Evaluation.docx...........................Final Evaluation.docx...........................
Final Evaluation.docx...........................
l1bbyburrell
 
Myopathies (muscle disorders) for undergraduate
Myopathies (muscle disorders) for undergraduateMyopathies (muscle disorders) for undergraduate
Myopathies (muscle disorders) for undergraduate
Mohamed Rizk Khodair
 
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptxANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
Mayuri Chavan
 
Origin of Brahmi script: A breaking down of various theories
Origin of Brahmi script: A breaking down of various theoriesOrigin of Brahmi script: A breaking down of various theories
Origin of Brahmi script: A breaking down of various theories
PrachiSontakke5
 
Myasthenia gravis (Neuromuscular disorder)
Myasthenia gravis (Neuromuscular disorder)Myasthenia gravis (Neuromuscular disorder)
Myasthenia gravis (Neuromuscular disorder)
Mohamed Rizk Khodair
 
How to Share Accounts Between Companies in Odoo 18
How to Share Accounts Between Companies in Odoo 18How to Share Accounts Between Companies in Odoo 18
How to Share Accounts Between Companies in Odoo 18
Celine George
 
E-Filing_of_Income_Tax.pptx and concept of form 26AS
E-Filing_of_Income_Tax.pptx and concept of form 26ASE-Filing_of_Income_Tax.pptx and concept of form 26AS
E-Filing_of_Income_Tax.pptx and concept of form 26AS
Abinash Palangdar
 
PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)
PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)
PHYSIOLOGY MCQS By DR. NASIR MUSTAFA (PHYSIOLOGY)
Dr. Nasir Mustafa
 
How to Configure Public Holidays & Mandatory Days in Odoo 18
How to Configure Public Holidays & Mandatory Days in Odoo 18How to Configure Public Holidays & Mandatory Days in Odoo 18
How to Configure Public Holidays & Mandatory Days in Odoo 18
Celine George
 
What is the Philosophy of Statistics? (and how I was drawn to it)
What is the Philosophy of Statistics? (and how I was drawn to it)What is the Philosophy of Statistics? (and how I was drawn to it)
What is the Philosophy of Statistics? (and how I was drawn to it)
jemille6
 
*"The Segmented Blueprint: Unlocking Insect Body Architecture"*.pptx
*"The Segmented Blueprint: Unlocking Insect Body Architecture"*.pptx*"The Segmented Blueprint: Unlocking Insect Body Architecture"*.pptx
*"The Segmented Blueprint: Unlocking Insect Body Architecture"*.pptx
Arshad Shaikh
 
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
How to Clean Your Contacts Using the Deduplication Menu in Odoo 18
Celine George
 
How to Manage Upselling in Odoo 18 Sales
How to Manage Upselling in Odoo 18 SalesHow to Manage Upselling in Odoo 18 Sales
How to Manage Upselling in Odoo 18 Sales
Celine George
 
LDMMIA Reiki News Ed3 Vol1 For Team and Guests
LDMMIA Reiki News Ed3 Vol1 For Team and GuestsLDMMIA Reiki News Ed3 Vol1 For Team and Guests
LDMMIA Reiki News Ed3 Vol1 For Team and Guests
LDM Mia eStudios
 
Cultivation Practice of Turmeric in Nepal.pptx
Cultivation Practice of Turmeric in Nepal.pptxCultivation Practice of Turmeric in Nepal.pptx
Cultivation Practice of Turmeric in Nepal.pptx
UmeshTimilsina1
 
*"Sensing the World: Insect Sensory Systems"*
*"Sensing the World: Insect Sensory Systems"**"Sensing the World: Insect Sensory Systems"*
*"Sensing the World: Insect Sensory Systems"*
Arshad Shaikh
 
spinal cord disorders (Myelopathies and radiculoapthies)
spinal cord disorders (Myelopathies and radiculoapthies)spinal cord disorders (Myelopathies and radiculoapthies)
spinal cord disorders (Myelopathies and radiculoapthies)
Mohamed Rizk Khodair
 
Botany Assignment Help Guide - Academic Excellence
Botany Assignment Help Guide - Academic ExcellenceBotany Assignment Help Guide - Academic Excellence
Botany Assignment Help Guide - Academic Excellence
online college homework help
 
All About the 990 Unlocking Its Mysteries and Its Power.pdf
All About the 990 Unlocking Its Mysteries and Its Power.pdfAll About the 990 Unlocking Its Mysteries and Its Power.pdf
All About the 990 Unlocking Its Mysteries and Its Power.pdf
TechSoup
 
Final Evaluation.docx...........................
Final Evaluation.docx...........................Final Evaluation.docx...........................
Final Evaluation.docx...........................
l1bbyburrell
 
Myopathies (muscle disorders) for undergraduate
Myopathies (muscle disorders) for undergraduateMyopathies (muscle disorders) for undergraduate
Myopathies (muscle disorders) for undergraduate
Mohamed Rizk Khodair
 
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptxANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
ANTI-VIRAL DRUGS unit 3 Pharmacology 3.pptx
Mayuri Chavan
 
Origin of Brahmi script: A breaking down of various theories
Origin of Brahmi script: A breaking down of various theoriesOrigin of Brahmi script: A breaking down of various theories
Origin of Brahmi script: A breaking down of various theories
PrachiSontakke5
 
Myasthenia gravis (Neuromuscular disorder)
Myasthenia gravis (Neuromuscular disorder)Myasthenia gravis (Neuromuscular disorder)
Myasthenia gravis (Neuromuscular disorder)
Mohamed Rizk Khodair
 
How to Share Accounts Between Companies in Odoo 18
How to Share Accounts Between Companies in Odoo 18How to Share Accounts Between Companies in Odoo 18
How to Share Accounts Between Companies in Odoo 18
Celine George
 
E-Filing_of_Income_Tax.pptx and concept of form 26AS
E-Filing_of_Income_Tax.pptx and concept of form 26ASE-Filing_of_Income_Tax.pptx and concept of form 26AS
E-Filing_of_Income_Tax.pptx and concept of form 26AS
Abinash Palangdar
 

AUTOMATED BUG TRIAGE USING ADVANCED DATA REDUCTION TECHNIQUES

  • 1. Journal for Research | Volume 02 | Issue 06 | August 2016 ISSN: 2395-7549 All rights reserved by www.journalforresearch.org 24 Automated Bug Triage using Advanced Data Reduction Techniques Irin Ani John Tintu Alphonsa Thomas PG Scholar Assistant Professor Department of Computer Science & Engineering Department of Computer Science & Engineering Amal Jyothi College of Engineering Kanjirappally , Kottayam, India Amal Jyothi College of Engineering Kanjirappally , Kottayam, India Abstract Bug triage is an important step in the process of bug fixing. The goal of bug triage is to correctly assign a developer to a newly reported bug in the system. To perform the automated bug triage, text classification techniques are applied. This will helps to reduce the time cost in manual work. To reduce the scale and improve the quality of bug data, the proposed system addresses the data reduction techniques, instance selection and feature selection for bug triage. The instance selection technique used here is to identify the relevant bugs that can match the newly reported bug. The feature selection technique is used to select the relevant data from each bug in the training set. A predictive model is proposed to identify the order in which the data reduction techniques are applied for each newly reported bug. This step will improve the performance of the classification process. An experimental study using Eclipse and Firefox bug data is undergone in which the proposed system shows an accuracy of 73%. Keywords: Data mining, bug triage, classification, data reduction, instance selection, feature selection _______________________________________________________________________________________________________ I. INTRODUCTION Bug fixing is a significant and time consuming process in software maintenance. For a large-scale soft-ware project, the number of daily bugs is so large. It is impossible to handle them without delaying. Software bugs are inevitable and bug fixing is an expensive process in software development. Software companies spend over 45 percent of cost in fixing bugs. Large software projects maintain bug repositories for the support of information collection and to manage the similar future bugs. A bug repository has an important role in handling software bugs. Bug repository is a typical software repository for storing bug details. It keeps the textual description about how to reproduce the bug and updates according to the current status of bug fixes. Bug triage, an important step for bug fixing, is to assign a new bug to a relevant developer for further handling. A general method for bug triage is to as-sign bugs manually. In practice, due to the frequent changes of software development teams, it is di cult to identify the correct developer in manual triage. Taking Eclipse2 as an example[1], Anvik reports that an average of 37 bugs per day are submitted to the bug tracking system and 3 person-hours per day are required for the manual triage; the empirical study[11] by Jeong et al. shows that 44% of bugs have been assigned to the wrong developer after the first assign- ment. To solve these problems, some machine learning algorithms are employed to conduct automatic bug triage. Most of the bug triage approaches are based on text categorization. However, these approaches suffer from two problems, namely the large- scale and the low-quality. It is necessary to collect large-scale training sets of bugs, due to the large number of bugs, to obtain good results for bug triage. It may cost much time to directly use the large-scale training set in the bug triage process. The quality of the original bug re-ports is not good enough. Two typical characteristics of low-quality bugs are noise and redundancy. Noisy bugs may mislead related developers while redundant bugs waste the limited time of bug handling. The proposed system is related with an automatic bug triage approach, which applies text classification techniques to predict developers for new bug reports. In this system, each bug report is considered as a document and its corresponding developer is considered as the label of the document. Since software bug data are a kind of free-form text data (generated by developers), it is necessary to generate well-processed bug data to facilitate the application. This system ad-dresses the problem of data reduction for bug triage, i.e., how to reduce the bug data to save the labour cost of developers and to improve the quality of bug data to facilitate the effective bug triage. The pro-posed system reduces the bug data according to two criteria: the scale of a data set and the accuracy of bug triage. Instance selection and feature selection are the two data reduction techniques implemented in the pro-posed system. While the instance selection technique selects the relevant bugs in the training set, the feature selection technique selects the relevant data in each bug in the training set according to the newly reported bug. This system determines the order of ap-plying the data reduction techniques for each newly re-ported bug. The order is determined by the attributes extracted from the new bug report. A binary classifier is trained to predict the order of reduction [12]. The following sections are about the study of the related works, about the proposed system, the experiments on the proposed system, the conclusions and future works.
  • 2. Automated Bug Triage using Advanced Data Reduction Techniques (J4R/ Volume 02 / Issue 06 / 005) All rights reserved by www.journalforresearch.org 25 II. RELATED WORK For modelling the bug data Anvik et al.[1] investigates the open bug repositories and the anatomy of bug reports. They proposed a semi-automated bug triage approach, which applies text classification techniques to predict developers for bug reports. To investigate the quality of bug data, Zimmermann et al.[2] design questionnaires to developers and users in three open source projects. Based on the analysis of questionnaires, they characterize what makes a good bug report and train a classifier to identify whether the quality of a bug report should be improved. Aggarwal et al.[5] developed a graphical paradigm for the text representation to process on the textual data. It preserves a high-level information about the ordering and distance between the words in the document. The developer social network is helpful to understand the developer community and the project evolution. By mapping bug priorities to developers, Xuan et al.[4] identify the developer prioritization in open source bug repositories. The developer prioritization can distinguish developers and assist tasks in software maintenance. To apply existing solutions to the new bug, Kim et al.[3] developed bug and x knowledge base by analysing the history of bug xes. Cubranic and Murphy [6] first propose the problem of automatic bug triage to reduce the cost of manual bug triage. They apply text classification techniques to predict related developers. Zou et al.[7][12] proposed the training set reduction with both feature selection and instance selection techniques for bug triage. Brighton et al.[8] introduce an instance selection algorithm and Yang and Pedersen [9] made a comparative study on different feature selection methods. Janaki Meera et al.[10] proposes a new feature selection algorithm based on chi square statistics for the Naive Bayes classifier. III. PROPOSED SYSTEM The proposed system comprises five main processing stages. The first step for the bug triage is the pre-processing of the bug data. Then the system predicts or-der of data reduction. It follows the instance selection and feature selection techniques which are applied for the data reduction process. Using the reduced training data, the classification process is applied for the newly reported bug. Preprocessing: Data pre-processing is one of the most important steps in text mining activity. The system is dealing with large number of text documents so this data pre-processing step is essential. This text pre-processing system consist of activities like extraction, validation, stop word removal and storing data formation. The bug details are provided as JSON and XML les by the providers. The system needs to import these les into the triaging system. After loading the les, to process the data, both JSON and XML data are converted into java readable format. The input le contains all the reported bugs. For training set, the bugs which are in Fixed status are only needed. Stop words are language specific functional words which carry no information. These are removed in this stage. The valid bug data which are used for training dataset is stored permanently in relational database. The system uses MySQL as the database to store the training set. Instance Selection: The system compares the new bug with training dataset, and only the relevant bugs are selected for further processing. Each words in the bug description of the new bug is searched for match in the training set. From the training set the matching bugs are considered as relevant bugs. Thus the dataset dimension is largely reduced and the accuracy of the classification process is improved. Feature Selection: In the proposed system each attributes of the bug is considered as the features of the bug. In this module the system selects the bugs from the training set according to the attributes of the new bug. Only the selected features in the training data set, which are given in the new bug, are selected for the classification process. Predicting the Reduction Order: The system predicts for each newly recorded bug, the order in which the data reduction techniques, i.e. instance selection and feature selection processes are applied. Two orders are there: FS->IS and IS->FS. In FS->IS, the feature selection process is first applied and the output of the feature selection process is applied as the input for the instance selection. The output of the instance selection process will be the reduced training set. In IS->FS, the instance selection process is applied initially and the output of the instance selection is applied as input for the feature selection. The prediction operation is performed ac-cording to the number of available attributes for the newly reported bug. A binary classifier is used to per-form these two different orders. Here a decision tree based algorithm is used as the binary classifier. Classification: The similarity measures are significant for the clustering and classification activities. These similarity measures are tested on the K-Nearest Neighbour classification algorithm. KNN is one of the most popular methods for single-label classification in which a document can belong to only one category. It classifies an unseen document by comparing it to its k nearest neighbours in a
  • 3. Automated Bug Triage using Advanced Data Reduction Techniques (J4R/ Volume 02 / Issue 06 / 005) All rights reserved by www.journalforresearch.org 26 specified training set. Given a document d, let Dk, with corresponding label set Lk, be a set containing the k most similar documents to d. Then d is classified to class c which appears most frequently in Lk. A random choice is made when a tie occurs. In this module from the reduced dataset, after applying the training set reduction techniques, the most similar bug is selected. In the proposed system classification is executed in two steps. In the first step the system filters bugs from the Instance selection list with the number of words matching. In the next step from the feature selection list the system checks for the matching features and chooses the bugs with the number of attribute matching. The common bugs are selected from the two lists. Then the number of matching words and attributes are summed for each selected bug and created a new list which is sorted according to the sum of Instance and Feature matching. Then from this list the bug id with the maximum sum value is selected. If the list contains more than one bug with same value then the developer of the most recent bug is selected as the developer. IV. EXPERIMENTS AND RESULTS Data Set: The bug repositories of two large open source projects, Eclipse and Mozilla Firefox are prepared for testing the proposed system. In total, 73,070 bug de-tails of Eclipse and 64,265 bug details of Firefox are used for experimental analysis of this system. Data Preparation: The details of the Eclipse bugs are provided as JSON formatted le, while the Firefox bugs are pro-vided as XML formatted le by the providers. The system imported these les into triaging system be-fore processing the bug data. Data Reduction and Classification: For the instance selection process, the proposed sys-tem applies a KNN based algorithm, while the similarity measures are used for the feature selection process. From the total dataset 6,919 data are used as test data and the remaining 1,30,416 as training dataset for evaluation of the system. It gets an overall 73% accuracy for the proposed system. Table 1 shows the performance details of the proposed system on Eclipse and Firefox bug dataset. Table – 1 Analysis Data Tools Eclipse Firefox Overall Total No. of Bugs 73,070 64,265 1,37,335 Training Data 69,752 60,664 1,30,416 Test Data 3,318 3,601 6,919 Correctly Triaged 2,167 2,908 5,075 Accuracy Percentage 65.3 80.75 73.03 V. CONCLUSION AND FUTURE WORK Bug triage is an expensive step of software maintenance in both labour cost and time cost. In this bug triaging system, feature selection and instance selection is combined to reduce the scale of training bug data sets as well as to improve the data quality. A predictive model is built to determine the order of applying the reduction techniques by extracting the attributes. This system can provide an approach, to leverage techniques on data processing, to form reduced and high-quality bug data in software development and maintenance. In the future work, the developer prioritization can be implemented during the classification process. This can provide a more appropriate class for the new bug. The training set reduction of bug triage can be applied to other tasks to improve the software quality. Since machine learning becomes one of the powerful tools in software engineering, the training set reduction can be useful for the work based on machine learning. REFERENCES [1] J. Anvik, L. Hiew, and G. C. Murphy, Who should fix this bug?, in Proc. 28th Int. Conf. Softw. Eng., May 2006, pp. 361-370. [2] T. Zimmermann, R. Premraj, N. Bettenburg, S. Just, A. Schroter, and C. Weiss, What makes a good bug report?", IEEE Trans. Softw. Eng., vol. 36, no. 5, pp. 618-643, Oct. 2010. [3] S. Kim, K. Pan, E. J. Whitehead, Jr., Memories of bug fixes", in Proc. ACM SIGSOFT Int. Symp. Found. Softw. Eng., 2006, pp. 3545. [4] J. Xuan, H. Jiang, Z. Ren, and W. Zou, Devel-oper prioritization in bug repositories", in Proc. 34th Int. Conf. Softw. Eng., 2012, pp. 25-35. [5] C. C. Aggarwal and P. Zhao, Towards graphical models for text processing", Knowl. Inform. Syst., vol. 36, no. 1, pp. 121, 2013. [6] D. Cubranic and G. C. Murphy, Automatic bug triage using text categorization", in Proc. 16th Int. Conf. Softw. Eng. Knowl. Eng., Jun. 2004, pp. 92-97. [7] W. Zou, Y. Hu, J. Xuan, and H. Jiang, Towards training set reduction for bug triage", in Proc. 35th Annu. IEEE Int. Comput. Soft. Appl. Conf., Jul. 2011, pp. 576-581. [8] H. Brighton and C. Mellish, Advances in in-stance selection for instance-based learning algorithms", Data Mining Knowl. Discovery, vol. 6, no. 2, pp. 153-172, Apr. 2002. [9] Y. Yang and J. Pedersen, A comparative study on feature selection in text categorization", in Proc. Int. Conf. Mach. Learn., 1997, pp. 412-420.
  • 4. Automated Bug Triage using Advanced Data Reduction Techniques (J4R/ Volume 02 / Issue 06 / 005) All rights reserved by www.journalforresearch.org 27 [10] M.Janaki Meena, K.R.Chandran, J.Mary Brinda and P.R.Sindhu,Enhancing Feature Selection Using Statistical Data with Unigrams and Bi-grams", in Int. Journal of Comp. Appl. (0975 - 8887) vol. 1, No. 11, 2010, pp.7-11. [11] G. Jeong, S. Kim, and T. Zimmermann, Improv-ing bug triage with tossing graphs, Proc. Joint Meeting European Software Engineering Conf. & ACM SIGSOFT Symp. Foundations of Software Engineering (ESEC-FSE 09), ACM, Aug. 2009, pp. 111-120. [12] J. Xuan, H. Jiang, Y. Hu, Z. Ren, W. Zou, Z. Luo, and X. Wu, Towards E_ective Bug Triage with Software Data Reduction Techniques", in IEEE Trans. on Knowl and Data Engg., vol. 27, No. 1, Jan 2015, pp.264-280.
  翻译: