How to normalize a JSON file into a Pandas dataframe

Question

I have a JSON file named stocks.json that looks as follows (note the lack of square brackets in the source file):

{"MSFT": {"exchange": "Nasdaq", "price": 275.79}, "FB": {"exchange": "Nasdaq", "price": 320.22}, "TSLA": {"exchange": "Nasdaq", "price": 990.83}, "GE": {"exchange": "Nasdaq", "price": 83.20}}

I would like to transform this data into a Pandas dataframe that looks as follows:

symbol exchange price MSFT Nasdaq 275.79 FB Nasdaq 320.22 TSLA Nasdaq 990.83 GE NYSE 83.20

My attempt is:

import pandas as pd stock_data = pd.read_json('stocks.json', lines=True) stock_data_normalized = pd.json_normalize(stock_data)

Unfortunately, I get the following when calling stock_data_normalized:

0 1 2 3

Any assistance would be most appreciated. Thanks!

user17242583 · Accepted Answer · 2021-12-23 22:22:15Z

3

You can just using the pd.DataFrame() constructor, and then transpose and reset the index:

df = pd.DataFrame(d).T.reset_index().rename({'index': 'symbol'}, axis=1)

Output:

>>> df symbol exchange price 0 MSFT Nasdaq 275.79 1 FB Nasdaq 320.22 2 TSLA Nasdaq 990.83 3 GE Nasdaq 83.2

answered Dec 23, 2021 at 22:22

user17242583

Sign up to request clarification or add additional context in comments.

8 Comments

equanimity Over a year ago

This solution results in a column named "0" with elements that look like: {"exchange": "Nasdaq", "price": 275.79}. How would I "unpack" that dictionary into separate columns?

user17242583 Over a year ago

Really? What data are you exactly using? It must be different from what you're showing. Also, what's your pandas version?

equanimity Over a year ago

Yes, the structure of the data in the source file is EXACTLY the same as what I provided. The Pandas version is 1.3.4. Did you apply your solution on the structure of the data I provided?

user17242583 Over a year ago

Yes, I copied all your json, put d = before it, and ran it, like this: d = {"MSFT": {"exchange": "Nasdaq", "price": ...

user17242583 Over a year ago

Maybe your data is nested in another object or array or something?

|

Collectives™ on Stack Overflow

How to normalize a JSON file into a Pandas dataframe

1 Answer 1

8 Comments

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

8 Comments

Related