Advanced
Please use this identifier to cite or link to this item: https://digital.lib.ueh.edu.vn/handle/UEH/71169
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorAssoc. Prof. Dr. Le Xuan Truongen_US
dc.contributor.advisorDr. Ta Quoc Baoen_US
dc.contributor.authorBui Thi Thien Myen_US
dc.date.accessioned2024-06-20T07:50:11Z-
dc.date.available2024-06-20T07:50:11Z-
dc.date.issued2024-
dc.identifier.otherBarcode: 1000016996-
dc.identifier.urihttps://opac.ueh.edu.vn/record=b1036923~S1-
dc.identifier.urihttps://digital.lib.ueh.edu.vn/handle/UEH/71169-
dc.description.abstractIn classification, imbalanced data occurs when there is a great difference in the quantities of classes of the training data set. This problem frequently arises in various fields, for example, credit scoring and medical diagnosis. With imbalanced data, predictive modeling for real-world applications has posed a challenge because most machine learning algorithms are designed for balanced data sets. Therefore, addressing imbalanced data has attracted much attention from researchers and practitioners. In this dissertation, we propose solutions for imbalanced classification. Furthermore, these solutions are applied to a credit scoring case study. The solutions are derived from three papers published in the scientific journals. The first paper presents an interpretable decision tree ensemble model for imbalanced credit scoring data sets. The second paper introduces a novel technique for addressing imbalanced data, particularly in the cases of overlapping and noisy samples. The final paper proposes a modification of Logistic regression focusing on the optimization F-measure, a popular metric in imbalanced classification. These classifiers have been trained on a range of public and private data sets with highly imbalanced status and overlapping classes. The primary results demonstrate that the proposed works outperform both traditional and some recent models.en_US
dc.format.medium123 p.en_US
dc.language.isoEnglishen_US
dc.publisherUniversity of Economics Ho Chi Minh Cityen_US
dc.subjectImbalanced dataen_US
dc.titleImbalanced data in classification: a case study of credit scoringen_US
dc.typeDissertationsen_US
ueh.specialityStatistics = Thống kêen_US
item.fulltextFull texts-
item.languageiso639-1English-
item.openairetypeDissertations-
item.grantfulltextreserved-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.cerifentitytypePublications-
Appears in Collections:DISSERTATIONS
Files in This Item:

File

Description

Size

Format

Show simple item record

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.