PERF/DOC: Option to .info() and .memory_usage() to provide for deep introspection of memory consumption #11595 #11596

jreback · 2015-11-13T18:23:01Z

closes #11595

jreback · 2015-11-13T18:23:13Z

cc @mrocklin

mrocklin · 2015-11-13T19:40:12Z

Does this descend into categories and the index?

jreback · 2015-11-13T20:03:09Z

wondering if you were going to ask that....it DOES do the index. not the categories, but I can fix this ......

jreback · 2015-11-13T20:32:55Z

Now includes embedded usage for Index & Categorical

In [5]: df = DataFrame({'A' : ['foo']*1000}) In [6]: df['B'] = df['A'].astype('category') In [8]: df.info() <class 'pandas.core.frame.DataFrame'> Int64Index: 1000 entries, 0 to 999 Data columns (total 2 columns): A 1000 non-null object B 1000 non-null category dtypes: category(1), object(1) memory usage: 16.6+ KB In [9]: df.info(deep=True) <class 'pandas.core.frame.DataFrame'> Int64Index: 1000 entries, 0 to 999 Data columns (total 2 columns): A 1000 non-null object B 1000 non-null category dtypes: category(1), object(1) memory usage: 55.7 KB In [11]: df.memory_usage() Out[11]: A 8000 B 1008 dtype: int64 In [12]: df.memory_usage(deep=True) Out[12]: A 48000 B 1048 dtype: int64

jreback · 2015-11-13T20:34:21Z

And providing on Series as well

In [6]: df['A'].memory_usage() Out[6]: 8000 In [7]: df['A'].memory_usage(index=True) Out[7]: 16000 In [8]: df['A'].memory_usage(index=True,deep=True) Out[8]: 56000

mrocklin · 2015-11-13T20:37:01Z

BTW, I'm glad that memory_usage_of_objects is usable on numpy arrays as well. I may end up using that outside of pandas.

jreback · 2015-11-13T20:39:31Z

right dask.array could certainly introspect here as well

max-sixty · 2015-11-13T20:46:37Z

I'll wait until this is merged before adding __getsize__.
Is there a reason index is False by default? I'd have thought that would be 'part of the package'.

jreback · 2015-11-13T20:50:02Z

@MaximilianR I don't recall the discussion, but I think we should change the default. Note that this is just for a direct call to memory_usage and not for .info where it is included.

why don't you post an issue and we'll change in 0.18 (as its a small API change).

…ntrospection of memory consumption, pandas-dev#11595

PERF/DOC: Option to .info() and .memory_usage() to provide for deep introspection of memory consumption #11595

jreback added Output-Formatting __repr__ of pandas objects, to_string API Design labels Nov 13, 2015

jreback added this to the 0.17.1 milestone Nov 13, 2015

jreback force-pushed the memory branch from a1d3fd1 to 0025b1c Compare November 13, 2015 20:32

max-sixty mentioned this pull request Nov 13, 2015

Change .memory_usage() index default to True #11597

Closed

jreback force-pushed the memory branch from 0025b1c to 7d0f11a Compare November 13, 2015 20:59

PERF/DOC: Option to .info() and .memory_usage() to provide for deep i…

89cad6b

…ntrospection of memory consumption, pandas-dev#11595

jreback force-pushed the memory branch from 7d0f11a to 89cad6b Compare November 13, 2015 21:34

jreback added a commit that referenced this pull request Nov 13, 2015

Merge pull request #11596 from jreback/memory

ddd0372

PERF/DOC: Option to .info() and .memory_usage() to provide for deep introspection of memory consumption #11595

jreback merged commit ddd0372 into pandas-dev:master Nov 13, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

PERF/DOC: Option to .info() and .memory_usage() to provide for deep introspection of memory consumption #11595 #11596

PERF/DOC: Option to .info() and .memory_usage() to provide for deep introspection of memory consumption #11595 #11596

Uh oh!

jreback commented Nov 13, 2015

jreback commented Nov 13, 2015

mrocklin commented Nov 13, 2015

jreback commented Nov 13, 2015

jreback commented Nov 13, 2015

jreback commented Nov 13, 2015

mrocklin commented Nov 13, 2015

jreback commented Nov 13, 2015

max-sixty commented Nov 13, 2015

jreback commented Nov 13, 2015

Labels

3 participants

Uh oh!

PERF/DOC: Option to .info() and .memory_usage() to provide for deep introspection of memory consumption #11595 #11596

PERF/DOC: Option to .info() and .memory_usage() to provide for deep introspection of memory consumption #11595 #11596

Uh oh!

Conversation

jreback commented Nov 13, 2015

jreback commented Nov 13, 2015

mrocklin commented Nov 13, 2015

jreback commented Nov 13, 2015

jreback commented Nov 13, 2015

jreback commented Nov 13, 2015

mrocklin commented Nov 13, 2015

jreback commented Nov 13, 2015

max-sixty commented Nov 13, 2015

jreback commented Nov 13, 2015

Labels

3 participants