Resources > Articles

How to Improve Your Models with Alternative Data Sources [+ 2 Examples]

Post Author
  • Pragmatic Institute is the transformational partner for today’s businesses, providing immediate impact through actionable and practical training for product, design and data teams. Our courses are taught by industry experts with decades of hands-on experience, and include a complete ecosystem of training, resources and community. This focus on dynamic instruction and continued learning has delivered impactful education to over 200,000 alumni worldwide over the last 30 years.

Alternative Data Sources

By Andres Gonzalez Casabianca

Picture this: You’ve been working hard on a project at work. You’ve run several algorithms, tuned the necessary hyperparameters, performed cross-validation and exhausted the checks required to ensure you’re not overfitting.

 

Yet, the performance metric isn’t where you would like it to be; or worse, isn’t where the business needs it to be. You take a hard look at your data science pipeline and don’t see any room for improvement.

 

What do you do? Go back to the source; specifically, go to an alternative source.

 

FinTechs working in the credit space differentiate themselves by their ability to muster alternative data sources and put them through their analytics pipeline. These companies aim to predict a person’s default probability, i.e. how likely they won’t pay their loan.

 

However, to get a competitive advantage from the established household names (e.g., Transunion, Equifax), they need to find uncharted information, clean it and finally, use it as input in their models.

 

Lenddo uses alternative data for credit scoring

 

Back in 2011, when social media was ramping up and people were creating their digital footprints, Jeff Stewart and Richard Eldridge founded Lenddo.

 

This fast-growing FinTech gathers data from social networks with the user’s authorization and analyzes over 12 thousand variables to create a score that represents the likelihood of default.

 

For example, Lenddo looks at how and with whom social media users interact, and the quality of their connections. Without getting too deep into the role of privacy in data science and the cleaning preprocessing, garnering this information is an excellent example of alternate data sources, the importance of data cleaning and optimization outside of the traditional parameters.

 

Branch is another startup that is thinking outside the box.

 

It operates mainly in Sub-Saharan Africa, focusing on financially underserved -and unserved- populations using alternative data sources to predict the likelihood of default.

 

Branch uses mobile data, ranging from cellphone battery charging patterns to SMS frequency and length, all gathered with the user’s consent. Branch cleans, crunches, and puts the information through its data science pipeline, transforming it into a credit score. This way, Branch has more input information for the machine learning algorithms and superior results against its competitors.

 

Here’s what Branch and Lenddo have in common.

Both FinTech companies mentioned above are built around financial prediction and data science, starting their pipelines by looking at unique, unmapped, and uncharted data.

 

However, in the era of Big Data, these data sets come with their own challenges, so a mix of technical knowledge and business understanding is key: data scientists must see the numbers and the colors.

 

Common problems that arise are:

  1. Over and under-representation
  2. Selection bias
  3. Uncleanable values
  4. Unwieldy data

 

These are the hidden costs of using alternative sources. Therefore, spend enough time understanding where the data is coming from and what that information is (beyond the numbers). If data scientists put garbage in, they will get garbage out. Companies need to adjust the pipeline for these biases to avoid erroneous and unactionable conclusions.

 

Next time you feel like you have hit a plateau, take a few steps back and ask yourself: What alternative source can I add to the pipeline? Whether it is from digital interactions, online preferences or other innovative source, alternative sources will help you improve the performance metric and will set you apart from the competition.

 

We saw how Lenddo and Branch use social networks and mobile patterns respectively to enhance their models and produce a novel credit score.

 

It does not matter what industry you work on, nor what type of challenge you are tackling, when the performance metric is off-target, go back and look for alternative data sources: there is always new and untapped data. Get creative, account for inherent biases in new data sets, and incorporate explainable metrics to evaluate your models.

 

Data Science for Business Leaders

 

Learn more about data strategies for your business in Pragmatic’s course: Data Science for Business Leaders  This course does a deep look into how  business leaders and data practitioners contribute at each stage of data projects to drive results that have a real impact on your organization

 

If you’re ready to gain powerful business insights with data, register today. 

 

 

Author

  • Pragmatic Institute is the transformational partner for today’s businesses, providing immediate impact through actionable and practical training for product, design and data teams. Our courses are taught by industry experts with decades of hands-on experience, and include a complete ecosystem of training, resources and community. This focus on dynamic instruction and continued learning has delivered impactful education to over 200,000 alumni worldwide over the last 30 years.

Author:

Other Resources in this Series

Most Recent

Woman Celebrating Great Customer Experience
Article

Create Data-Driven Emotional Connections with Customers to Drive Revenue

By leveraging data, businesses can create emotional connections with their customers that drive revenue and build loyalty. In this article, we'll explore some real-world examples of how data is being used to create emotional connections with customers. 
Category: Data Science
Analyzing electronic document
Article

The Future of Finance: How Data Analytics is Unlocking New Opportunities

According to a recent PwC report, “About 60% of respondents to the PwC and ACCA research believe self-service reporting and automation will free up business partner time, taking transactional and compliance responsibilities off their hands
Two hands engaged in a handshake
Article

How Data Partnerships Can Help Your Company Scale

In today’s business landscape, the role of data has become increasingly important. Companies are constantly seeking ways to collect, analyze, and utilize data to gain a competitive edge.  One way that companies are leveraging data
Category: Data Science
Working in Silos Can Be Harmful
Article

Breaking the Cycle: How to Deal with Data Hoarding in Your Organization

Data hoarding, also known as information hoarding, is a common issue that plagues many organizations. It occurs when team leads or other individuals in positions of power or knowledge withhold important information from their colleagues
Category: Data Science
Woman Examining Data Trends That Will Dominate 2023
Article

Looking Ahead: Data Trends That Will Dominate 2023

Data analytics has revolutionized business decision-making, unleashing a storm of new opportunities impacting every part of the business, from identifying customers to minimizing churn.
Category: Data Science

OTHER ArticleS

Woman Celebrating Great Customer Experience
Article

Create Data-Driven Emotional Connections with Customers to Drive Revenue

By leveraging data, businesses can create emotional connections with their customers that drive revenue and build loyalty. In this article, we'll explore some real-world examples of how data is being used to create emotional connections with customers. 
Category: Data Science
Analyzing electronic document
Article

The Future of Finance: How Data Analytics is Unlocking New Opportunities

According to a recent PwC report, “About 60% of respondents to the PwC and ACCA research believe self-service reporting and automation will free up business partner time, taking transactional and compliance responsibilities off their hands

Sign up to stay up to date on the latest industry best practices.

Sign up to received invites to upcoming webinars, updates on our recent podcast episodes and the latest on industry best practices.

Training on Your Schedule

Fill out the form today and our sales team will help you schedule your private Pragmatic training today.

Subscribe

Subscribe

Training on Your Schedule

Fill out the form today and our sales team will help you schedule your private Pragmatic training today.