0
$\begingroup$

I am currently working on a sentiment study, which means I want to investigate the influence of sentiment on stock prices and trading volume. Regarding stock prices is relatively simple, I calculate the return, expressed as a percentage. Since it is an intraday data set I calculate this return for a period over 5 or 10 minutes for example. Since the return is expressed as a percentage, I can compare very well across different companies.
It gets more complicated with the trading volume, as this is expressed in an absolute number (for example 14238 stocks per minute). Since companies issue a different number of shares it is not possible to compare them. Even if you look at just one company, the number of shares traded per minute varies greatly. Therefore, I would like to ask how to transform the trading volume so that I can 1.) detect abnormal intraday movements well and 2.) also compare well between several companies. My data set looks like this:

 TIMESTAMP HIGH LOW OPEN CLOSE COUNT VOLUME <dttm> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> 1 2021-11-08 07:51:00 1970 1942 1942 1952. 9 14238 2 2021-11-08 07:52:00 1970 1946. 1970 1946. 11 33902 3 2021-11-08 07:53:00 1946. 1946. 1946. 1946. 7 25574 4 2021-11-08 07:54:00 1945 1935. 1943 1935. 4 15546 5 2021-11-08 07:55:00 1933. 1920 1933. 1920 4 18044 6 2021-11-08 07:56:00 1916 1916 1916 1916 6 26496 7 2021-11-08 07:57:00 1916 1907 1907 1916 12 69053 8 2021-11-08 07:58:00 1916 1903. 1916 1903. 16 111587 9 2021-11-08 07:59:00 1911. 1903. 1903. 1911. 23 189042 10 2021-11-08 08:00:00 1910 1903. 1910 1909. 98 1228126 

Many thanks in advance!

(And can someone tell me what the variable COUNT means?)

$\endgroup$
5
  • $\begingroup$ percentage relative to total amount of shares in circulation? or volume value in the currency you prefer ? $\endgroup$ Commented Nov 18, 2021 at 10:43
  • $\begingroup$ Possibly COUNT is the number of trades which took place during the 1 minute interval. Then the ratio VOLUME/COUNT would be the average size of those trades. (Just my guess). What market is this BTW? $\endgroup$ Commented Nov 18, 2021 at 16:33
  • $\begingroup$ What do you mean with BTW? @noob2 $\endgroup$ Commented Nov 18, 2021 at 17:49
  • $\begingroup$ BTW = "by the way". I am just curious where this data comes from. $\endgroup$ Commented Nov 18, 2021 at 19:58
  • $\begingroup$ It is intraday stock data and comes from Reuters Refinitiv. $\endgroup$ Commented Nov 18, 2021 at 20:20

1 Answer 1

3
$\begingroup$

Intraday volumes are known to have a seasonality:

  • at the start of the day you encounter more volume
  • similarly at the end of the day.

In Europe there is a burst in volume 1h before the issue of US macro news: enter image description here

This seasonality is not only present for volume, but for all liquidity-related variables:

enter image description here

(screenshots from Market Microstructure in Practice, 2nd Edition, by L and Laruelle)

You first question is about how to take into account the relative volumes from one stock (or instrument) to another:

  • the usual practice is to renormalise by the "recent traded volume" $\bar V$, for instance a moving average of one to four months
  • this renormalisation has to be done in the log-space, since volumes are a positive variable (ie $V\mapsto \log(V/\bar V)$

Few remarks

  • it is also known that the intraday seasonality if made of two effect:
    • one "daily volume" effect
    • one "intraday proportion" effect
  • hence the natural way to model the volume during a "slice" $V(d, t)$ from $t-\Delta t$ to $t$ on day $d$ is to write it as $V(d, t) = V(d) \times \rho(t)$ where $V(d)$ is the volume of this day (yes it is not a causal --aka adapted-- formula) and $\rho(t)$ is the fraction of the daily volume from $t-\Delta t$ to $t$

You should hence probably have two estimators (for renormalisation proposes)

  1. one for the daily volume
  2. one for the "usual fraction of the day" from $t-\Delta t$ to $t$

Since daily volumes are, like daily volatility, subject to clustering, you may try a "GRACH-like model" for the daily volume. In fact you can use a VARlike model to jointly capture the 4 variables of interest (see Endogeneous Dynamics of Intraday Liquidity, by Bińkowski and L)

Now let me ask a question: is this seasonality more due to optimal trading practices or to reaction to news?

  • probably not all is due to news since it seems very consistent from one day to another
  • nevertheless the peak on European volume has to be seen as an influence of US macro news...

So how to renormalise, in the context of a study on News, without mixing endogenous and heterogenous information, is an open question...

$\endgroup$

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.