Python's equivalent for R's dput() function

Question

Is there any function in python similar to dput() function in R?

Perhaps the pickle module?

BrenBarn
– BrenBarn

2014-03-15 02:47:21 +00:00
Commented Mar 15, 2014 at 2:47 — BrenBarn
– BrenBarn, Commented Mar 15, 2014 at 2:47
stackoverflow.com/a/41189949/850781

sds
– sds

2018-07-30 19:04:57 +00:00
Commented Jul 30, 2018 at 19:04 — sds
– sds, Commented Jul 30, 2018 at 19:04

PatrickT · Accepted Answer · 2024-09-03 17:46:20Z

31

for a pandas.DataFrame, print(df.to_dict()), as shown here and detailed in the manual.

And back again with df = pandas.DataFrame.from_dict(data_as_dict)

The default output style is 'orient=dict', but if you prefer 'orient=list', then:

print(df.to_dict('list'))

edited Sep 3, 2024 at 17:46

answered Apr 20, 2018 at 5:12

PatrickT

10.6k9 gold badges83 silver badges117 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

Andrew Brēza Over a year ago

Great answer! This is exactly what I was looking for.

Álvaro A. Gutiérrez-Vargas Over a year ago

@PatrickT, do you know how to do that with other objects such as dictionaries?

PatrickT Over a year ago

If I understand your question, print(d) will do that. You can also output the keys and the values separately with d.keys() and d.values(). Maybe your question is more involved? Look at this perhaps: stackoverflow.com/questions/3229419/…

Christian Aichinger · Accepted Answer · 2018-05-12 01:29:49Z

There are several options for serializing Python objects to files:

json.dump() stores the data in JSON format. It is very read- and editable, but can only store lists, dicts, strings, numbers, booleans, so no compound objects. You need to import json before to make the json module available.
pickle.dump() can store most objects.

Less common:

The shelve module stores multiple Python objects in a DBM database, mostly acting like a persistent dict.
marshal.dump(): Not sure when you'd ever need that.

As this is a beginner's question would you please clarify if it requires to import json or something similar. Also I tried it on a pandas.DataFrame and got dump() missing 1 required positional argument: 'fp' ...
Could you illustrate with an example, @ChristianAichinger? I agree with what @PatrickT said since I am getting the same error dump() missing 1 required positional argument: 'fp'

JonasV · Accepted Answer · 2021-04-19 07:23:09Z

How no one has mentioned repr() yet is a mystery to me. repr() does almost exactly what R's dput() does. Here's a few examples:

>>> a = np.arange(10) >>> repr(a) 'array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])' >>> d = dict(x=1, y=2) >>> repr(d) "{'x': 1, 'y': 2}" >>> b = range(10) >>> repr(b) 'range(0, 10)'

It is still inferior to dput because it does not keep the data type of the columns :/

KenHBS · Accepted Answer · 2018-08-14 08:25:33Z

This answer focuses on json.dump() and json.dumps() and how to use them with numpy arrays. If you try, Python will hit you with an error saying that ndarrays are not JSON serializable:

import numpy as np import json a = np.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]]) json.dumps(a)

TypeError: Object of type 'ndarray' is not JSON serializable

You can avoid this by translating it to a list first. See below for two working examples:

`json.dumps()`

json.dumps() seems to be the closest to R's dput() since it allows you to copy-paste the result straight from the console:

json.dumps(a.tolist()) # '[[1, 2, 3], [4, 5, 6], [7, 8, 9]]'

`json.dump()`

json.dump() is not the same as dput() but it's still very useful. json.dump() will encode your object to a json file.

# Encode: savehere = open('file_location.json', 'w') json.dump(a.tolist(), savehere)

which you can then decode elsewhere:

# Decode: b = open('file_location.json', 'r').read() # b is '[[1, 2, 3], [4, 5, 6], [7, 8, 9]]' c = json.loads(b)

Then you can transform it back a numpy array again:

c = np.array(c)

More information

on avoiding the 'not serializable' error see:

numpy array is not json serializable
how to make classes json serializable (kind of unrelated, but very interesting)

Thanks, what would be the correct parameter to obtain matrices formatted with one line per row in the matrix, as the standard numpy.array output? I tried to pass the indent and separators parameters to json.dumps without success.

Hack-R · Accepted Answer · 2017-07-27 13:55:40Z

0

IMO, json.dumps() (note the s) is even better since it returns a string, as opposed to json.dump() which requires you to write to a file.

edited Jul 27, 2017 at 13:55

Hack-R

23.5k15 gold badges82 silver badges138 bronze badges

answered Dec 11, 2016 at 9:47

Jas

83413 silver badges23 bronze badges

1 Comment

Hack-R Over a year ago

Could you provide more details on how to use this?

Collectives™ on Stack Overflow

Python's equivalent for R's dput() function

5 Answers 5

3 Comments

2 Comments

2 Comments

`json.dumps()`

`json.dump()`

More information

1 Comment

1 Comment

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

3 Comments

2 Comments

2 Comments

json.dumps()

json.dump()

More information

1 Comment

1 Comment

Linked

Related

`json.dumps()`

`json.dump()`