select rows in sql with latest date for each ID repeated multiple times [duplicate]

Question

I have a table where each ID is repeated 3 times. there is a date in front of each id in each row.
I want to select entire row for each ID where date is latest. There are total 370 columns in this table i want all columns to get selected when i select that row.

Sample -

ID Name Date Marks .. .. .. 1 XY 4/3/2017 27 1 fv 4/3/2014 98 1 jk 4/3/2016 09 2 RF 4/12/2015 87 2 kk 4/3/2009 56 2 PP 4/3/2011 76 3 ee 4/3/2001 12 3 ppp 4/3/2003 09 3 lll 4/3/2011 23

The Answer should be

ID Name Date Marks .. .. .. 1 XY 4/3/2017 27 2 RF 4/12/2015 87 3 lll 4/3/2011 23

I am attempting as below -

select distinct ID,*,max(date) as maxdate from table

Also i am trying this in Hive . so not sure if some sql functions dont work in Hive

Thanks

similar question has been answered here- stackoverflow.com/questions/13523049/… — Rahul Sharma
– Rahul Sharma, Commented Jul 29, 2017 at 1:34

ecarlin · Accepted Answer · 2017-07-28 22:23:04Z

94

This question has been asked before. Please see this question.

Using the accepted answer and adapting it to your problem you get:

SELECT tt.* FROM myTable tt INNER JOIN (SELECT ID, MAX(Date) AS MaxDateTime FROM myTable GROUP BY ID) groupedtt ON tt.ID = groupedtt.ID AND tt.Date = groupedtt.MaxDateTime

edited Jul 28, 2017 at 22:23

answered Jul 28, 2017 at 20:28

ecarlin

1,2339 silver badges8 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

Earthshaker Over a year ago

looks like in the last line it should be AND tt.Date = groupedtt.MaxDateTime .... Thanks

ecarlin Over a year ago

Sure thing. If you liked the answer please upvote and select as the accepted answer.

Harat Over a year ago

what if date comes from user and same or prevoius dates come???

Oto Shavadze · Accepted Answer · 2017-07-28 20:22:20Z

One way is:

select table.* from table join ( select ID, max(Date) as max_dt from table group by ID ) t on table.ID= t.ID and table.Date = t.max_dt

Note that if you have multiple equally higher dates for same ID, then you will get all those rows in result

yes ..exactly thats' how it is happening ... then i think i can just remove the duplicate rows from entire database ... then those multiple rows will go away and for each id there will be only one row.

Strawberry · Accepted Answer · 2017-07-28 20:50:26Z

7

You can do this with a Correlated Subquery (That is a subquery wherein you reference a field in the main query). In this case:

SELECT * FROM yourtable t1 WHERE date = (SELECT max(date) from yourtable WHERE id = t1.id)

Here we give the yourtable table an alias of t1 and then use that alias in the subquery grabbing the max(date) from the same table yourtable for that id.

edited Jul 28, 2017 at 20:50

Strawberry

34k14 gold badges43 silver badges57 bronze badges

answered Jul 28, 2017 at 20:23

JNevill

50.6k4 gold badges46 silver badges72 bronze badges

5 Comments

Yaroslav Dukal Over a year ago

this is a very slow way to do that.

JNevill Over a year ago

@MaksimKniazev I disagree. MySQL's execution plan will be similar to the explicit INNER JOIN on an aggregated subquery in this case.

saran3h Over a year ago

Takes forever when the data is huge.

john k Over a year ago

@YaroslavDukal as opposed to a JOIN?

Yaroslav Dukal Over a year ago

yes, join is more optimal

RichGoldMD · Accepted Answer · 2017-07-28 20:57:29Z

5

You can use a join to do this

SELECT t1.* from myTable t1 LEFT OUTER JOIN myTable t2 on t2.ID=t1.ID AND t2.`Date` > t1.`Date` WHERE t2.`Date` IS NULL;

Only rows which have the latest date for each ID with have a NULL join to t2.

edited Jul 28, 2017 at 20:57

answered Jul 28, 2017 at 20:23

RichGoldMD

1,2821 gold badge10 silver badges18 bronze badges

5 Comments

Strawberry Over a year ago

Old school. Nice.

Earthshaker Over a year ago

This is giving me error as Both left and right aliases encountered in JOIN 'date'

RichGoldMD Over a year ago

Is that the error mysql is reporting? I don't know if its the problem but Date is a reserver word so I've updated my answer with backticks.

RichGoldMD Over a year ago

I think that's a HIVE error - My code is intended for MySql

Lenard Bartha Over a year ago

LEFT OUTER JOIN is synonym to LEFT JOIN, its clearer that way I think to use the simpler terms...

Andrew · Accepted Answer · 2017-07-28 20:23:16Z

Here's one way. The inner query gets the max date for each id. Then you can join that back to your main table to get the rows that match.

select * from <your table> inner join (select id, max(<date col> as max_date) m where yourtable.id = m.id and yourtable.datecolumn = m.max_date)

milton · Accepted Answer · 2017-07-28 20:22:42Z

-2

Have you tried the following:

SELECT ID, COUNT(*), max(date) FROM table GROUP BY ID;

answered Jul 28, 2017 at 20:22

milton

571 bronze badge

1 Comment

JNevill Over a year ago

This will not return all of the fields in the table for the record with that id that has the max date.

Collectives™ on Stack Overflow

select rows in sql with latest date for each ID repeated multiple times [duplicate]

6 Answers 6

3 Comments

1 Comment

5 Comments

5 Comments

Comments

1 Comment

Linked

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

3 Comments

1 Comment

5 Comments

5 Comments

Comments

1 Comment

Linked

Related