Three people review a document on a laptop.

Working Paper

Constructing Applicants from Loan-Level Data: A Case Study of Mortgage Applications

February 2025

WP 25-05 – We develop an algorithm to detect loan applicants who submit multiple applications in a loan-level dataset. We estimate that in our data our method identifies applicants that submit multiple mortgage applications with 93 percent precision.

We develop a clustering-based algorithm to detect loan applicants who submit multiple applications (“cross-applicants”) in a loan-level dataset without personal identifiers. A key innovation of our approach is a novel evaluation method that does not require labeled training data, allowing us to optimize the tuning parameters of our machine learning algorithm. By applying this methodology to Home Mortgage Disclosure Act data, we create a unique dataset that consolidates mortgage applications to the individual applicant level across the United States. Our preferred specification identifies cross-applicants with 93 percent precision.

View the Full Working Paper

Measuring Fairness in the U.S. Mortgage Market

February 2025

WP 25-04 – How fair or unfair is the U.S. mortgage market? We show that the answer crucially depends on one's definition of fairness. By contrasting six competing definitions, we offer a comprehensive view of fairness in this market.

Mortgage Markets

Working Paper

Paying Too Much? Borrower Sophistication and Overpayment in the U.S. Mortgage Market

Neil Bhutta,
Andreas Fuster &
Aurel Hizmo

June 2024

WP 24-11 – Comparing mortgage rates that borrowers obtain to rates that lenders could offer for the same loan, the authors find that many homeowners significantly overpay for their mortgage, with overpayment varying across borrower types and with market interest rates.