Software defect prediction is related to the testing area of software industry. Several methods have been developed for the prediction of bugs in software source codes. The objective of this study is to find the inconsistency of performance between imbalances and balance data set and to find the distinction of performance between single classifier and aggregate classifier (voting). In this investigation, eight publicly available data sets have collected, also seven algorithms and hard voting are used for finding precision, recall and F-1 score to predict software defect. In these collected data, two sets are almost balanced. For this investigation, these balanced data sets have converted into imbalanced sets as average non-defective and defective ratio of the other 6 data sets. The experiment result shows that performance of the two balanced data sets is lower than other six sets. After conversion of two data sets, the performance has increased as like as other six data sets. Another observation is the performance metric that shows the results of precision, recall and F1-score for voting are 0.92, 0.84 and 0.87 respectively, which are better than other single classifier. This study has been able to shows that- imbalance of non-defective and defective classes have a big impact on software defect prediction and the voting is the best performer among the classifiers