Get Help From Experts in Data Science and Mathematics
ePortfolios |STEM Gender Equality | ZOOM | Slack | Spreading Mathematical Happiness

You can support MathsGee with your DONATION

0 like 0 dislike
15 views
What cross validation technique would you use on time series data set? Is it k-fold or LOOCV?
by Diamond (53,882 points) | 15 views

1 Answer

0 like 0 dislike

I would use any of the two methods because a time series is sequential in a chronological manner.

In time series problems, k fold is not ideal because there might be some pattern in year 4 or 5 which is not in year 3. Resampling the data set will separate these trends, and we might end up validation on past years, which is incorrect. Instead, we can use forward chaining strategy with 5 fold as shown below:

  • fold 1 : training [1], test [2]
  • fold 2 : training [1 2], test [3]
  • fold 3 : training [1 2 3], test [4]
  • fold 4 : training [1 2 3 4], test [5]
  • fold 5 : training [1 2 3 4 5], test [6]

where 1,2,3,4,5,6 represents “year”.

by Wooden (3,542 points)

Related questions

0 like 0 dislike
0 answers
asked Nov 15, 2020 by Teddy Wooden (3,542 points) | 11 views
0 like 0 dislike
0 answers
0 like 0 dislike
1 answer
asked Mar 11, 2019 by Edzai Zvobwo Bronze Status (8,657 points) | 27 views
0 like 0 dislike
0 answers
0 like 0 dislike
0 answers
0 like 0 dislike
0 answers
0 like 0 dislike
1 answer
asked Oct 17, 2019 by Tedsf Diamond (53,882 points) | 19 views
0 like 0 dislike
1 answer
13,102 questions
10,327 answers
101 comments
11,185 users