Skip to content

http://www.market-topology.com/

4 messages · Spencer Graves, Yuri Volchik, Krishna Kumar

#
Hi,

was wondering what algorithm they are using and if it is available in R.
Using my own judgment it looks like minimum spanning tree algorithm and in R
it's available in the packages igraph and vegan, is it correct?

Thanks
#
The text on 'www.market-topology.com' includes the following:  
"The engine developed by Market Topology SPRL emphasizes the relevant 
correlations which connect the equities. ... In addition, a procedure 
selects the highest coefficients for placing them in a connected graph 
which is a tree." 

      This suggests they are doing some sort of cluster analysis.  
RSiteSearch('cluster analysis', 'fun') just produced 314 hits for me.  I 
rarely use cluster analysis, but I would guess that the most popular 
methods would include hclust{stats}, agnes{cluster}, and Mclust{mclust}. 

      Note, however, that many cluster analysis methods are similar to 
reading tea leaves (http://en.wikipedia.org/wiki/Tasseography) in the 
sense that they will find relationships, independent of whether there 
are relationships to find.  Mclust is model based, which suggests to me 
that it may be less subject to "false positives" than other methods.  
However, you should not take my word for this;  perhaps someone with 
more experience with these methods will enlighten us. 

      A major problem is that you are looking for relationships among 
5,000 or so financial time series, possibly with fewer than 5,000 
observations.  To obtain a full rank estimate of a correlation matrix of 
that size, you need at least that many observations -- and much of the 
subtle structure on which this analysis depends would typically be 
poorly estimated.  Moreover, it would take roughly 20 years of daily log 
returns, for example, just to get a full rank estimate.  Just computing 
that matrix assumes that the relationships are all constant over that 
period of time. 

      It may still be useful, but one would need to understand its 
limitations. 

      Hope this helps. 
      Spencer
Yuri Volchik wrote:
25 days later
#
Hi Spencer,
thanks for your answer, i did look at those packages.
given your very informative post (i understand your point about possible
changes in correlation structure etc) how they can do what they do :) ,their
classification makes sense to me.

I presume one can use intraday prices, say with 5 min intervals, thus in one
year would have around 20000 observations so it should be enough from a
computational viewpoint.

If there is any good link (book) on the subj, would appreciate it.

Thanks
#
Yuri Volchik wrote:
There is a reference to MST in the book by Stanley and Mantega,
http://www.amazon.com/exec/obidos/ASIN/0521620082/kriskumar-20

There is also a couple of papers on the web co-authored  by some folks 
with Mantega. Also you will find this fx reference useful
http://arxiv.org/abs/physics/0503014

Cheers
Krishna