Back to Projects GitHub Repository

Car Loan Price Estimation

Purpose and Situation: This project is based on a case study for an auto loan organization, e-car. We reviewed and examined the business model and past data to determine if e-cars could potentially generate higher profits through auto car loans. According to the case study, e-car has encountered difficulties in obtaining loan acceptance from potential customers. Upon reviewing the case data, it became apparent that e-cars were offering considerably higher interest rates to customers deemed risky, placing them in a higher risk tier. We believe that e-car may have been overpricing its loans, significantly exceeding market rates for car loans. In today's competitive business landscape, e-cars must competitively price their offerings while also mitigating risk and minimizing losses.

Upon brief data exploration, it became evident that the acceptance rate of e-car loans was unusually low. Out of the 208,077 loans offered, only 45,785 were accepted, resulting in a 22% acceptance rate. We propose that e-car's objective should be to achieve an acceptance rate of over 50%. Several areas for improvement have been identified:

We have decided to address this issue using both value-based and cost-based pricing strategies. Value-based pricing entails quoting fair prices and providing seamless service from start to finish. Cost-based pricing will enable us to determine the best pricing strategy, ensuring that we neither miss opportunities by overpricing nor undercut potential profits by underpricing the loans.

Task:

After exploring the data further, we were able to gain more insight into the possible areas of improvement. With so many factors to consider, e-car needs to identify what are the important factors in decision making while customers are shopping for car loans.

Out of all these factors, rate and loan terms are something that e-car can control directly and by changing the parameters we were able to get insight, predict and estimate possible revenue for e-car. After estimating and predicting the acceptance of each loan we can attempt to optimize the pricing by using parameters from our preferred model. In seeking to validate our hypothesis that the "Interest rate has a causal effect on the decision made by customers," we propose employing causal inference and estimating the effect of manipulating the rate (APR).

We are proposing to use the methods below to predict loan outcomes and optimize the pricing better to maximize returns. Models we attempted are below.

Action:

Cleaning Data set and Training Causal Model: The dataset contains 208,077 data sets. After cleaning the data and observing trends (see the code submitted for data cleaning and observed insights), we divided the dataset into three equal parts (n= 69,363 per group). Partition 1 was utilized to train the IVSLS model using the factors we mentioned above. Partition 3 and partition 2 were used as treatment and control sets respectively. We chose to consider tier, Car Type, Amount, Competition Rate and IV variable [Rate ~ Season] to train our model. The model yielded an R² value of 0.81 and Accuracy was 88.8%. Once the model was trained, we moved on to our experiment section. The code base will be submitted with the package.

Experiment: Based on the Nomis solution case suggestion, we decided to segment our data further into 8 (using the median as the breaking point) Tiers, then treat each loan in the tier with varying rates. The baseline acceptance rate was 26.10% and 23.3% for our experimental (treatment) and control groups respectively. The samples were assigned randomly, and we decided to not manipulate the data further to avoid biases. We decided to start by matching the competitor’s rate (general market rate) from tier 3- 8 initially and vary the rate to optimize it better. We found that matching the competitor's rate yielded ~75% acceptance rate and thus we had room to quote a bit higher to minimize risk and still meet our goal of >50% acceptance rate. Finally, we decided to match the market rate for new cars but add two value +2% for the other cars (refinance and used). The final acceptance rate was 66%.

Net Profit Optimization:

Per the project instructions, we calculated profit in one segment, segment 4, for demonstration. We simulated treatment as mentioned above by segmenting the tiers into eight segments, setting the price at the market rate for new cars, and price for used and refinance was set at competitor rate +2 and normalized term 45-70 months (most accepted loans were in that range). Regarding term, we suggest that customers will be given term options in that range, with more options for customers in lower tier. For this experiment’s purpose, we just normalized the initial value. Following that calculated expected interest earning, Total cost, expected loss, probability of default (Risk) and the final net profit. See Appendix for formulas used.

Results:

The findings of our analysis revealed a substantial increase in the overall acceptance rate from 22% to 66%, consequently leading to a notable improvement in the total profit margin. The logistic regression model resulted in ~32% increase with the same treatment. Our strategy primarily targeted securing acceptance of quotes, thereby amplifying our net profitability. Focusing on a specific tier, namely Segment 4 (n=7704), initially yielded a profit of $9.5 million, exclusive of any projected losses or forfeitures. Following adjustments to the interest rates and loan durations, Segment 4 witnessed a surge in profitability to $41.1 million, prior to factoring in expected losses. Accounting for expected losses, the net profit estimate settled at $12.7 million.

In summary, our consultancy recommends e-car to vigilantly monitor prevailing market rates and strive to align their loan offerings accordingly. Additionally, optimizing the duration of loan terms and refining segmentation strategies could further boost the acceptance rate. Despite the inherent risks of default that come with lowering rate price, our analysis demonstrates a significant increase in net profit as acceptance rates improve with even when factoring the probability of default. Prioritizing the acquisition of potential customers not only elevates acceptance rates but also augments revenue per accepted quotation.

To conclude, it is critical to acknowledge that the dataset utilized comprises one year of sales records. Access to a more extensive historical data set could provide deeper insights. Therefore, continuous evaluation, prediction, and estimation are essential for the efficacy of marketing and pricing models. Furthermore, it is crucial to replicate this experiment multiple times to ascertain its validity and applicability.

Appendixes

1. Profit and Risk Formulas used.