Predicting Credit Risk by using PySpark ML and Docker Part-1

Inan ateş
Jan 8 · 8 min read

Analyzing a dataset about Credit risk

Credit risk can be explained as the possibility of a loss because of a borrower’s failure to repay a loan or meet contractual obligations. Basically, it means the risk that a lender may not receive the owed principal and interest. The higher risk implies the higher cost, that makes this topic important for many people. In this article, we will analyze a dataset about the loan status of applicants and make predictions for new applications via different machine learning algorithms.

To keep reading this story, create a free account.

Already have an account? Sign in

Thanks to Kiarash Irandoust

ITNEXT

ITNEXT is a platform for IT developers & software engineers to share knowledge, connect, collaborate, learn and experience next-gen technologies.

Welcome to a place where words matter. On Medium, smart voices and original ideas take center stage - with no ads in sight. Watch
Follow all the topics you care about, and we’ll deliver the best stories for you to your homepage and inbox. Explore
Get unlimited access to the best stories on Medium — and support writers while you’re at it. Just $5/month. Upgrade