Could anyone help me out with proper import settings for an excel file with a following kind of structure (for time series analysis):
label1 varName label2 random text label3 random text2 2015-01-01 01:00 85 2015-01-01 02:00 26 2015-01-01 03:00 15 2015-01-01 04:00 13 2015-01-01 05:00 22 2015-01-01 06:00 21 2015-01-01 07:00 13 2015-01-01 08:00 22 2015-01-01 09:00 20 2015-01-01 10:00 31 2015-01-01 11:00 36 2015-01-01 12:00 33 2015-01-01 13:00 33 2015-01-01 14:00 33 label and varName are rows to keep. Rows 2 and 3 should be deleted.
Database starts at Jan 1st 2015 1am and ends at 31dec 2015 11pm. For most days I have a value for each hour. There are some NAs inside values but nrow=8760
I'm still learning how to do time series in R, but I'd imagine that it would be easier to handle summaries (say day by day means) if R would split %Y-%m-%d %H:%M column into two separate ones.
Simple import with RStudio default readxl library fails as rows 2 and 3 get imported and date is translated into a funny format: 42005.041666666664
Normally I would deal with this by hand (in excel) and import a clean txt to R. Problem is I need to process 61 similar files (for different years and different variables). I'm sure there is a way to automate this task, but after 6hours of searching, testing and reading I'm basically in the same spot as this morning.
I'd appreciate any kind of hint or help. Thank you

read_excel(path = file_name, col_names = c('label1','varname'), col_types = c('date', 'numeric'), skip = 3). If the files don't have the same column names, then you can get the columns in each file and then skip 3 rows.xlsx. The only problem with this package is that it relies on a working installation of therJavaandxlsxjarspackages, which are really hard to set up properly. But if you have it installed, then there is very much you can do with itsread.xlsxfunction.