A Quantitative Approach to Private Equity Fund Selection

A Quantitative Approach to Private Equity Fund Selection

Key Points

  • Drawing on decades of private equity market experience, we have developed a machine-driven scoring tool to substantiate our human decision-making in the fund selection process.
  • The scores produced can be turned into investment decisions through an economic cost/benefit analysis to create a framework that can be calibrated according to business needs.
  • The tool should allow the investment team to screen a larger pool of potential opportunities and focus more on value-creating activities.


Private equity (PE) has long been immune to the radical, technology-driven changes that other asset classes have experienced over the last decade, due to limited transparency, a lack of consistent data and longer investment horizons. In fact, the classical PE investment selection approach, which is heavily dependent on human judgement, has hardly changed since the 1980s.

Until now, this approach has allowed investors to obtain returns that have consistently exceeded other asset classes, despite its time-consuming nature and associated costs. However, as the PE industry reaches maturity and faces growing scrutiny from investors in terms of fees and returns, it is becoming increasingly important for investors to make accurate, high conviction decisions in a fast and efficient manner. For this reason, at Unigestion, we believe that it is essential to integrate new technologies into the investment decision-making process, to be at the forefront of innovation in the industry and to anticipate disruptive changes.

Artificial intelligence is reshaping the financial world by changing the way investments are analysed and selected. Ample research has been carried out in more traditional fields of finance, like public equities, to assess if machine learning can enhance the predictive power of existing models. The work of Gu et al. [2019] on forecasting public equity risk premia goes in this direction and has paved the way for further research to be carried out for more illiquid asset classes, such as private equity.

Complementing Human Judgement with Machine-Powered Analysis

At Unigestion, we are able to integrate the due diligence experience that we have gained over the last few decades in determining which fund characteristics are the most important drivers of superior risk-adjusted returns into a machine-driven analysis of a wide range of features. Drawing on our experience, we have developed a machine learning-based scoring tool to predict which funds will be successful, based on the information available to prospective investors during fundraising. By combining PE fund performance data from Preqin with other sources, we select the most relevant criteria, including the investment strategy, team composition, market conditions, strategy execution and performance track record.

The model has been set up to follow the binary classification approach, which determines the probability that a fund’s performance will exceed a pre-defined hurdle rate. Thus, the output of each algorithm will be a score between 0 and 1, representing such a probability. In this way, the model output can be easily turned into an investment recommendation.

We have calibrated the model through a wide selection of machine learning algorithms to obtain the best out-of-sample performance. As a metric to assess performance, we use the area under the ROC curve (AUC), which represents the probability that a randomly chosen successful fund (IRR above the hurdle rate) is attributed a higher score than a randomly chosen unsuccessful fund. In these terms, an AUC of 0.5 is equivalent to flipping a coin. Thus, the closer the AUC is to 1, the better the discriminating power of the model.

Our research has shown some promising model-based predictive power for fund performance and we believe this could be enhanced further in future by incorporating proprietary performance data.

Figure 1: Out-of-Sample AUC for Various Machine Learning Models

Figure 1 Out-of-Sample AUC for Various Machine Learning Models
Source: Unigestion, based on Prequin data as at 31 December 2018

For each algorithm, we calibrate the model twice, once using as target the funds that ultimately have performed better than the hurdle rate, and once with those that ultimately underperformed. The final score is calculated as the average of the two models. This makes the model more robust and allows us to give different weights to the analysed features in the prediction of successful and unsuccessful funds.

Figure 1 shows the performance in terms of AUC of the various machine learning algorithms1 considered and compares it with the simpler linear OLS classifier. All the algorithms show a forecasting power significantly superior to flipping a coin (AUC larger than 0.5), but surprisingly, at the current stage, non-linearities captured by these machine learning models do not explain a significantly larger portion of the PE fund performance. This might be due to the insufficient granularity of commercial datasets, which could be partially mitigated in future implementations by incorporating our proprietary data, covering more than 20 years of PE fund performance.

Figure 2 shows the ROC curve for the random forest classifier, the best-performing algorithm, and compares it to the straight line, which corresponds to flipping a coin.

Figure 2: ROC Curve of Random Forest Classifier

Figure 2 ROC Curve of Random Forest Classifier
Source: Unigestion, based on Prequin data as at 31 December 2018

The performance of the classification models considered so far can also be assessed through the confusion matrix, a table that shows the proportion of instances that were correctly/incorrectly classified for both successful and unsuccessful funds. Figure 3 shows the confusion matrix for the out-of-sample performance of the random forest classifier, the algorithm that performed best in terms of AUC.

Figure 3: Normalised Confusion Matrix for the Random Forest Classifier and Benefit/Cost Matrix

Figure 3: Normalised Confusion Matrix for the Random Forest Classifier and Benefit/Cost Matrix
Source: Unigestion, based on Prequin data as at 31 December 2018

The scores produced by the algorithms considered so far can be turned into investment decisions through an economic cost/benefit analysis, based on the combination of the confusion matrix and a cost-benefit matrix. The latter is a four-entry matrix that associates a cost or a benefit to each of the four possible outcomes of the prediction problem. Interpreting the entries of the confusion matrix as probabilities, we can compute the expected value of the total gains/losses, given a certain choice of the threshold on the score, above which the fund is considered successful. In this way, we obtain an expected value framework that can be calibrated according to business needs. This will be subject to further research.

Integrating an initial machine-powered screening of potential investment opportunities increases the investment team’s processing power and should enable them to ‘turn over more stones’ in the dealflow.

Enhancing Returns through Collaborative Intelligence

Integrating an initial machine-powered screening of potential investment opportunities increases the investment team’s processing power and should enable them to ‘turn over more stones’ in the dealflow. Analysing a larger pool of investment opportunities increases the probability of identifying and investing in ‘hidden gems’, which can have a positive impact on overall portfolio returns.

Even just focusing on the deselection of funds with a low probability of achieving the hurdle should enhance overall portfolio returns by improving the average performance. It should also boost the investment team’s efficiency by allowing them to focus on more value-creating activities, while also lowering abort costs (i.e. sunk costs where a potential investment is dropped at a later stage in the due diligence process).

In particular, quantitative scoring has relevance for the later funds raised by established managers. First and second funds of emerging managers will still require a heavy dose of experienced human judgement. As well as lacking a track record, emerging managers need to be chosen on more intangible evidence such as specific skills, team chemistry and well-balanced incentives.

In addition to the machine learning-based scoring tool, a cost-benefit analysis could be very practical in the investment decision-making process. If the current risk assessment during the investment evaluation process is complemented by actual value attribution, the entire process can benefit from significant efficiency gains. The investment team can better allocate time across various projects and work streams, and the investment committees will have more tangible grounds for more consistent and transparent investment decision-making.

What Next: A Systematic Investment Strategy for Private Equity?

Screening and initial evaluation of opportunities is just the beginning. The power of AI technology can be applied to the entire investment cycle, from deal origination to value creation. This will disrupt the conservative private equity industry – indeed, the first attempts have been already observed in the market.

As data science resources and tools are becoming more available and increasingly powerful, we are likely to see the emergence of new business models and strategies in the industry. We believe factor-based systematic strategies will become an integral part of PE investing.

The choice of the right factors and access to sufficient volumes of historical data are crucial for the identification and testing of data-driven models. Unigestion is well-positioned to advance its quantitative research and continue its journey towards a digitalised PE world by leveraging its 20+ years of extensive PE experience and data accumulated over this time.


1Athey, Susan and Imbens, Guido W. Machine Learning Methods that Economists Should Know About (August 2019). Annual Review of Economics, Vol. 11, pp. 685-725, 2019. Available at SSRN: https://ssrn.com/abstract=3445877

Shihao Gu, Bryan Kelly, and Dacheng Xiu. Empirical Asset Pricing via Machine Learning. Chicago Booth Research Paper No. 18-04; 31st Australasian Finance and Banking Conference 2018; Yale ICF Working Paper No. 2018-09, 23, 2018. Available at SSRN: https://ssrn.com/abstract=3159577

Steven N. Kaplan and Antoinette Schoar. Private Equity Performance: Returns, Persistence and Capital Flows. The Journal of Finance, 60(4):1791-1823, 2005. Available at: https://onlinelibrary.wiley.com/doi/full/10.1111/j.1540-6261.2005.00780.x


Find out more about how we use collaborative intelligence at Unigestion

Read more…

Important Information

This document is provided to you on a confidential basis and must not be distributed, published, reproduced or disclosed, in whole or part, to any other person.

The information and data presented in this document may discuss general market activity or industry trends but is not intended to be relied upon as a forecast, research or investment advice. It is not a financial promotion and represents no offer, solicitation or recommendation of any kind, to invest in the strategies or in the investment vehicles it refers to. Some of the investment strategies described or alluded to herein may be construed as high risk and not readily realisable investments, which may experience substantial and sudden losses including total loss of investment.

The investment views, economic and market opinions or analysis expressed in this document present Unigestion’s judgement as at the date of publication without regard to the date on which you may access the information. There is no guarantee that these views and opinions expressed will be correct nor do they purport to be a complete description of the securities, markets and developments referred to in it. All information provided here is subject to change without notice. To the extent that this report contains statements about the future, such statements are forward-looking and subject to a number of risks and uncertainties, including, but not limited to, the impact of competitive products, market acceptance risks and other risks.

Data and graphical information herein are for information only and may have been derived from third party sources. Although we believe that the information obtained from public and third party sources to be reliable, we have not independently verified it and we therefore cannot guarantee its accuracy or completeness. As a result, no representation or warranty, expressed or implied, is or will be made by Unigestion in this respect and no responsibility or liability is or will be accepted. Unless otherwise stated, source is Unigestion.

Past performance is not a guide to future performance. All investments contain risks, including total loss for the investor.

Unigestion SA is authorised and regulated by the Swiss Financial Market Supervisory Authority (FINMA). Unigestion (UK) Ltd. is authorised and regulated by the UK Financial Conduct Authority (FCA) and is registered with the Securities and Exchange Commission (SEC). Unigestion Asset Management (France) S.A. is authorised and regulated by the French “Autorité des Marchés Financiers” (AMF). Unigestion Asset Management (Canada) Inc., with offices in Toronto and Montreal, is registered as a portfolio manager and/or exempt market dealer in nine provinces across Canada and also as an investment fund manager in Ontario and Quebec. Its principal regulator is the Ontario Securities Commission (OSC). Unigestion Asia Pte Limited is authorised and regulated by the Monetary Authority of Singapore (MAS). Unigestion Asset Management (Copenhagen) is co-regulated by the “Autorité des Marchés Financiers” (AMF) and the “ Danish Financial Supervisory Authority” (DFSA). Unigestion Asset Management (Düsseldorf) SA is co-regulated by the “Autorité des Marchés Financiers” (AMF) and the “Bundesanstalt für Finanzdienstleistungsaufsicht” (BAFIN).