start index at 1 for Pandas DataFrame [duplicate]

Question

I need the index to start at 1 rather than 0 when writing a Pandas DataFrame to CSV.

Here's an example:

In [1]: import pandas as pd In [2]: result = pd.DataFrame({'Count': [83, 19, 20]}) In [3]: result.to_csv('result.csv', index_label='Event_id')

Which produces the following output:

In [4]: !cat result.csv Event_id,Count 0,83 1,19 2,20

But my desired output is this:

In [5]: !cat result2.csv Event_id,Count 1,83 2,19 3,20

I realize that this could be done by adding a sequence of integers shifted by 1 as a column to my data frame, but I'm new to Pandas and I'm wondering if a cleaner way exists.

alko · Accepted Answer · 2013-11-23 21:57:00Z

191

Index is an object, and default index starts from 0:

>>> result.index Int64Index([0, 1, 2], dtype=int64)

You can shift this index by 1 with

>>> result.index += 1 >>> result.index Int64Index([1, 2, 3], dtype=int64)

answered Nov 23, 2013 at 21:57

alko

48.7k12 gold badges99 silver badges105 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

yourstruly Over a year ago

somehow it changes index name - so proper order with naming is: df.index+=1;df.index.name='name'

Matt_Haythornthwaite Over a year ago

caution using this when in an ipython kernel (such as juypter) that you don't run the cell containing this code more than once. It will add one to the index every time which will not produce the desired result.

Troll · Accepted Answer · 2022-09-13 02:32:49Z

41

Just set the index before writing to CSV.

df.index = np.arange(1, len(df) + 1)

And then write it normally.

edited Sep 13, 2022 at 2:32

Troll

1,9253 gold badges18 silver badges35 bronze badges

answered Nov 23, 2013 at 21:54

TomAugspurger

29k8 gold badges89 silver badges71 bronze badges

3 Comments

Dung Over a year ago

where np is import like so: import numpy as np

santhosh_dj Over a year ago

efficient way : df.index = range(1, df.shape[0] + 1)

Loc Quan Over a year ago

even more efficient: df.index = pd.RangeIndex(1, len(df.index) + 1). My benchmark shows pd.RangeIndex() is 30% faster than range(), and 185% faster than np.arange() (index size ~100 000). And len(df.index) is most efficient for finding row count.

Community · Accepted Answer · 2017-05-23 12:34:14Z

source: In Python pandas, start row index from 1 instead of zero without creating additional column

Working example:

import pandas as pdas dframe = pdas.read_csv(open(input_file)) dframe.index = dframe.index + 1

What's the difference with the top one and three years later?

mosc9575 · Accepted Answer · 2022-05-19 05:41:22Z

In my opinion best practice is to set the index with a RangeIndex

import pandas as pd result = pd.DataFrame( {'Count': [83, 19, 20]}, index=pd.RangeIndex(start=1, stop=4, name='index') ) >>> result Count index 1 83 2 19 3 20

I prefer this, because you can define the range and a possible step and a name for the index in one line.

Liu Yu · Accepted Answer · 2018-04-28 07:06:41Z

9

This worked for me

 df.index = np.arange(1, len(df)+1)

answered Apr 28, 2018 at 7:06

Liu Yu

4131 gold badge7 silver badges18 bronze badges

Comments

Imran · Accepted Answer · 2017-08-25 14:06:53Z

8

Another way in one line:

df.shift()[1:]

answered Aug 25, 2017 at 14:06

Imran

6561 gold badge11 silver badges20 bronze badges

2 Comments

Armali Over a year ago

This drops the last row.

Prashant G Over a year ago

What a scary answer!

Utku · Accepted Answer · 2018-11-23 11:00:06Z

You can use this one:

import pandas as pd result = pd.DataFrame({'Count': [83, 19, 20]}) result.index += 1 print(result)

or this one, by getting the help of numpy library like this:

import pandas as pd import numpy as np result = pd.DataFrame({'Count': [83, 19, 20]}) result.index = np.arange(1, len(result)+1) print(result)

np.arange will create a numpy array and return values within a given interval which is (1, len(result)+1) and finally you will assign that array to result.index.

Matt_Haythornthwaite · Accepted Answer · 2023-02-20 11:29:39Z

Following on from TomAugspurger's answer, we could use list comprehension rather than np.arrange(), which removes the requirement for importing the module: numpy. You can use the following instead:

df.index = [i+1 for i in range(len(df))]

ivanleoncz · Accepted Answer · 2019-01-29 16:44:08Z

Fork from the original answer, giving some cents:

if I'm not mistaken, starting from version 0.23, index object is RangeIndex type

From the official doc:

RangeIndex is a memory-saving special case of Int64Index limited to representing monotonic ranges. Using RangeIndex may in some instances improve computing speed.

In case of a huge index range, that makes sense, using the representation of the index, instead of defining the whole index at once (saving memory).

Therefore, an example (using Series, but it applies to DataFrame also):

>>> import pandas as pd >>> >>> countries = ['China', 'India', 'USA'] >>> ds = pd.Series(countries) >>> >>> >>> type(ds.index) <class 'pandas.core.indexes.range.RangeIndex'> >>> ds.index RangeIndex(start=0, stop=3, step=1) >>> >>> ds.index += 1 >>> >>> ds.index RangeIndex(start=1, stop=4, step=1) >>> >>> ds 1 China 2 India 3 USA dtype: object >>>

As you can see, the increment of the index object, changes the start and stop parameters.

Flair · Accepted Answer · 2021-11-10 08:29:26Z

0

This adds a column that accomplishes what you want

df.insert(0,"Column Name", np.arange(1,len(df)+1))

edited Nov 10, 2021 at 8:29

Flair

2,9572 gold badges33 silver badges45 bronze badges

answered Nov 9, 2021 at 23:00

Jen

1

Comments

prashantwitty · Accepted Answer · 2022-03-23 11:11:19Z

0

Add ".shift()[1:]" while creating a data frame

data = pd.read_csv(r"C:\Users\user\path\data.csv").shift()[1:]

answered Mar 23, 2022 at 11:11

prashantwitty

92 bronze badges

1 Comment

kiradotee Over a year ago

I lost the last row by doing that

Collectives™ on Stack Overflow

start index at 1 for Pandas DataFrame [duplicate]

11 Answers 11

2 Comments

3 Comments

1 Comment

Comments

Comments

2 Comments

Comments

Comments

Comments

Comments

1 Comment

Linked

Hot Network Questions

Collectives™ on Stack Overflow

11 Answers 11

2 Comments

3 Comments

1 Comment

Comments

Comments

2 Comments

Comments

Comments

Comments

Comments

1 Comment

Linked

Related