Into the data mine's depths

Reading Time:3 minutes

Published: 12:00am, 23 Oct 2011

So you've lost money in the stock market lately? Join the club. Many people I know have lost between 10 per cent and 30 per cent of their portfolio since April. We did our research, tracked stock price movements, spotted trends, perused available information to try to predict which way the markets were heading. To no avail.

In the past six months, we've been buffeted by some of the greatest volatility in global financial history. During the week of August 4-11, for instance, the S&P 500 index recorded an average daily movement of 4.25 per cent (compared to only slightly above 0.5 per cent since 1950).

A key contributor to this extreme volatility is the dominance of computer-driven high-frequency trading that makes up more than 65 per cent of transactions in the stock market today. And key to such trading is something we all do: data mining. But while small investors like you and me do a primitive sort of data mining by Googling price-to-earning ratios, high-frequency traders apply sophisticated mathematical techniques to sift through colossal amounts of information. Buying and selling billions of dollars of stock in nanoseconds, such traders drive the markets, leaving ordinary investors at the mercy of the tsunami they generate.

Data mining is the wave of the future in many other fields too. As its name implies, data mining entails extracting hidden predictive information from massive databases to identify patterns and detect relationships. This is useful to governments, corporations, just about anyone. For example, if a telecoms company can map relationships between the age, sex, income, hobbies and education levels of a large population of mobile-phone users, it can use that to predict the potential size of a new market.

Since antiquity humans have made connections between natural phenomena, such as the changing of the seasons and the movement of planets, and activities in their daily lives, such as planting and harvesting of crops or picking auspicious dates for important events. Observing phenomena, interpreting information and establishing relationships and patterns were early attempts at what we call data mining today. Now, with powerful computers and ultra-high-capacity storage devices, immense amounts of complex information are collated and analysed, and highly sophisticated applications developed. Whenever you use an internet search engine, you are invoking its data-mining capabilities. For example, Googling my name, Tom Yam, will produce over 3.2 million results, albeit mostly spicy soup recipes and Thai restaurants. That is a lot of information mined from just two words.

Imagine, then, the capability of a vast complex of computer farms unceasingly monitoring and analysing all significant global events for high-frequency traders. This goes well beyond traditional analysis of stock price movements.