Context

Credit risks refer to the risks of loss on a debt that occurs when the borrower fails to repay the principal and related interest amounts of a loan back to the lender on due dates.

When a bank receives a loan application, based on the applicant’s profile the bank has to make a decision for its approval or rejection. There are two types of risks associated with this decision:

If the applicant has good credit risk, i.e. is likely to repay the loan, then rejecting the loan results in a loss to the bank
If the applicant has bad credit risk, i.e. is unlikely to repay the loan, then approving the loan results in a loss to the bank

It may be assumed that the second risk is a greater risk, as the bank (or any other institution lending the money to an untrustworthy party) had a higher chance of not being paid back the borrowed amount.

So it's on the part of the bank or other lending authority to evaluate the risks associated with lending money to a customer.

Problem Statement

Imagine a bank in your locality. The bank has realized that applying data science methodologies can help them focus their resources efficiently, make smarter decisions on credit risk calculations, and improve performance.

Earlier they used to check the credit risk of the loan applicants manually by analyzing their bank-related data, which used to take months of time. But this time they want a smart data scientist who can automate this process.

Objective

You are required to build a machine learning model that helps you predict the credit risk of the loan applicants.

Evaluation Criteria

Submissions are evaluated using Accuracy Score.

How do we do it?

Once you generate and submit the target variable predictions on the test dataset, your submissions will be compared with the true values of the target variable.

The True or Actual values of the target variable are hidden on the DPhi platform so that we can evaluate your model's performance on unseen data. Finally, an accuracy score for your model will be generated and displayed

Timeline

Start Date: 9th October 2020, 21:00 hours IST / 17:30 hours CET (please locate your time here)

End Date: 12th October 2020, 21:00 hours IST / 17:30 hours CET (please locate your time here)

Problem Setters: Nisrin Dhoondia, Manish KC

Data Sprint #9: Credit Risk

Challenge Starts

Registration Ends

Challenge Ends