What does a quant need from alternative data?

What does a quant need from alternative data?

Quants like data, in the same way I like burgers. Quants like to consume data to come up with ways of forecasting the market. The most important dataset is usually market data. However, there are all sorts of other interesting datasets that can be used to help forecast those market prices. In recent years, the area of alternative data has exploded. There are many datasets now available from alternative data vendors. Obviously a quant wants a dataset which can make them money! But before that stage, the question is, if you're a quant, which datasets are worth investigating for alpha? We need a way of pruning down our massive list of alternative datasets before we go anywhere near a Python script! There simply isn't enough time to investigate every dataset for alpha thoroughly (we can do fairly standardised tests such as finding the correlation with price action etc. but these aren't necessarily going to tell us if there's a specific profitable trading rule we can apply). For alternative data vendors, what do they need to do, to make their datasets usable by quants? Hence, there are many questions we need to ask about a dataset first, we even want to investigate whether a dataset can provide us any insight into forecasting markets. Below we give a list of questions a quant might want to ask about a dataset, that they might want to use to find alpha.

Historical data for a dataset

One of the most important is the length of the dataset. If we want our dataset to forecast prices on a daily basis, and we only have a month of data, it isn't really going to be that useful. One month is simply not sufficient to gauge the robustness of a dataset. There is a balance however. Ideally, we'd like several years of data to do a historical backtest. There is a balance however, because, if the dataset does turn out to be valuable, do we really want to wait 10 years till there's enough historical data to use it? This is tricky!

Timeliness and point-in-time

When is the data available? If it takes many weeks on end, with a very large lag, it's likely to be more difficult to trade off it. Furthermore, is the dataset properly timestamped, so we know the time that it is released to users, not just when it is collected. This is crucial for trading....

To read the rest of the article on the Cuemacro website, please click here

Marvin Schneider

Strategy Consultant | Valuation Expert | Author | Speaker

6y

I agree Johan Vanderlugt that alternative datasets are also relevant to the non-quant population. We have developed the first alternative dataset of integrated metrics focused on long-term wealth creation achieved in ways that enhance the wellbeing of the wider community. Surely this is relevant to all institutional investors and fiduciaries that invest on behalf of pensioners and retirees, let alone fuel sustainable wealth creation in the real economy!

Jerckns Cruz

Superintendente Geral, Produtos de Tesouraria

6y

Saeed Amen very well writen, thanks for the article! In the name of the stratsphera community, can i kindly ask your permission to repost a translated version in our blog? All credita given and link to the original post. Many thanks!

Johan Vanderlugt

Researcher | Nature Strategy & Finance | Sustainable Investing

6y

Also relevant for the non-quant population out there!

Matthew Nelson

CEO, Founder, Technology Speaker and Data Management Specialist

6y

I have a unique data set which reflects risk emerging in company ecosystems (eg supply chains, parent companies, customers, etc). I’ve quantifies severities of the language identified and would like to speak to traders about alpha in my RISC Scores. Would love to discuss further!

Like
Reply

Mattia Giammarusto Quants like data in the same way I like burgers 😂

To view or add a comment, sign in

More articles by Saeed Amen

  • Trip notes from USA

    Over the past week and half I’ve been travelling in America, starting with Chicago, then Omaha and finally New York…

  • Berkshire Hathaway meeting 2025

    Go to a fancy London steakhouse, and there will be a chance that you will get served steak from Nebraska. I’ve been to…

    2 Comments
  • The dollar, yields and inflation

    Harold Wilson once said a week is a long time in politics. He might have had the foresight to be referring to…

    2 Comments
  • Neudata 2025 London conference

    I recently attended the Neudata conference on alternative data in London. I had last gone quite a few years ago, and I…

  • Learning from running financial models live

    Let's say you are the world's best burger chef (we all have ambitions, right). You'd be serving up all manner of…

    1 Comment
  • DeepSeek, objectives and constraints

    When a new burger joint opens up, there's often a buzz. Everyone (well, at least me) wants to try the new burger.

    1 Comment
  • Hundreds of quant papers from #QuantLinkADay in 2024

    I tweet a lot (from @saeedamenfx and at BlueSky at @saeedamenfx.bsky.

  • What we’ve learnt from reading thousands of Fed communications

    We recently had the last FOMC decision of 2024. Market l participants reacted to the hawkish tone including Powell’s…

    1 Comment
  • Takeaways from QuantMinds 2024 in London

    Over the past years, the quant industry has changed substantially. My first visit to Global Derivatives was just over a…

    12 Comments
  • Takeaways from Web Summit 2024

    Think of Lisbon and no doubt it’ll conjure images of explorers setting sail in centuries past across the ocean, the…

    1 Comment

Insights from the community

Others also viewed

Explore topics