SQL: How to properly check if a record exists

Question

While reading some SQL Tuning-related documentation, I found this:

SELECT COUNT(*) :

Counts the number of rows.
Often is improperly used to verify the existence of a record.

Is SELECT COUNT(*) really that bad?

What's the proper way to verify the existence of a record?

Martin Schapendonk · Accepted Answer · 2019-09-26 08:49:45Z

399

It's better to use either of the following:

-- Method 1. SELECT 1 FROM table_name WHERE unique_key = value; -- Method 2. SELECT COUNT(1) FROM table_name WHERE unique_key = value;

The first alternative should give you no result or one result, the second count should be zero or one.

How old is the documentation you're using? Although you've read good advice, most query optimizers in recent RDBMS's optimize SELECT COUNT(*) anyway, so while there is a difference in theory (and older databases), you shouldn't notice any difference in practice.

edited Sep 26, 2019 at 8:49

answered Nov 23, 2010 at 8:23

Martin Schapendonk

13.6k3 gold badges22 silver badges26 bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

Martin Schapendonk Over a year ago

I will clarify that I intended "unique key" with the "key = value" clause but other than that I'm still behind my answer.

Martin Ba Over a year ago

OK. With that premise indeed the query would return just one or zero record. BUT: The question does not limit to a unique column. Also: The 2nd query count(1) is equivalent to count(*) from a practical POV.

Martin Schapendonk Over a year ago

The question says "what's the proper way to verify the existence of A record". I interpreted that as singular, as in: 1 record. The difference between count(*) and count(1) is already covered by my answer. I prefer count(1) because it does not rely on a specific RDBMS implementation.

McSinyx Over a year ago

If unique_key is indexed, is it guaranteed that the statement will run strictly under linear time?

Martin Schapendonk Over a year ago

@McSinyx in theory that is correct, although it totally depends on the specific RDBMS implementation that you are running this on.

Pavel Morshenyuk · Accepted Answer · 2016-08-11 07:24:43Z

I would prefer not use Count function at all:

IF [NOT] EXISTS ( SELECT 1 FROM MyTable WHERE ... ) <do smth>

For example if you want to check if user exists before inserting it into the database the query can look like this:

IF NOT EXISTS ( SELECT 1 FROM Users WHERE FirstName = 'John' AND LastName = 'Smith' ) BEGIN INSERT INTO Users (FirstName, LastName) VALUES ('John', 'Smith') END

Generaly we use it (the verify) when want do something, then your answer is more complete.
I would expect the query planner to do a better job optimizing EXISTS than the other methods; it should know it doesn't have to scan the whole table.

Cătălin Pitiș · Accepted Answer · 2010-11-23 08:20:18Z

39

You can use:

SELECT 1 FROM MyTable WHERE <MyCondition>

If there is no record matching the condition, the resulted recordset is empty.

answered Nov 23, 2010 at 8:20

Cătălin Pitiș

14.4k2 gold badges41 silver badges63 bronze badges

4 Comments

Jacob Over a year ago

Did you mean TOP 1? -> (SELECT TOP 1 FROM MyTable WHERE <MyCondition>)

Cătălin Pitiș Over a year ago

No, I meant exactly "1"

eFloh Over a year ago

to enable the query optimizer to even knwo that you won't read/need the remaining datasets, you should state SELECT TOP 1 1 FROM... WHERE... (or use the appropriate query hints for your RDBS)

AquaAlex Over a year ago

The Exists operator itself tries to retrieve just the absolute minimum of information, so the addition of TOP 1 does nothing except add 5 characters to the query size. - sqlservercentral.com/blogs/sqlinthewild/2011/04/05/…

AAEM · Accepted Answer · 2018-03-06 03:02:34Z

27

You can use:

SELECT 1 FROM MyTable WHERE... LIMIT 1

Use select 1 to prevent the checking of unnecessary fields.

Use LIMIT 1 to prevent the checking of unnecessary rows.

edited Mar 6, 2018 at 3:02

AAEM

1,8602 gold badges20 silver badges26 bronze badges

answered Jan 14, 2015 at 9:54

user3059943

2713 silver badges3 bronze badges

2 Comments

Leo Gurdian Over a year ago

Good point but Limit works on MySQL and PostgreSQL, top works on SQL Server, you should note it on your answer

Gray Programmerz Over a year ago

unnecessary with respected to ?

oski · Accepted Answer · 2013-04-08 17:13:23Z

24

SELECT COUNT(1) FROM MyTable WHERE ...

will loop thru all the records. This is the reason it is bad to use for record existence.

I would use

SELECT TOP 1 * FROM MyTable WHERE ...

After finding 1 record, it will terminate the loop.

answered Apr 8, 2013 at 17:13

oski

2712 silver badges3 bronze badges

4 Comments

Eirik H Over a year ago

In case of SELECT TOP 1 will it actually terminate after finding one or does it continue to find all to be able to say which one is TOP?

Eirik H Over a year ago

PS: To be sure I always IF EXISTS (SELECT TOP 1 1 FROM ... WHERE ..)

eFloh Over a year ago

the Star operator will force the DBMS to access the clustered index instead of just the index(es) that will be needed for your join condition. so it's better to use a constant valua as result, i.e. select top 1 1 .... That will return 1 or DB-Null, depending on the condition is a match or not.

isxaker Over a year ago

it's nice. I like the first one.

JesseW · Accepted Answer · 2010-11-23 08:22:14Z

17

The other answers are quite good, but it would also be useful to add LIMIT 1 (or the equivalent, to prevent the checking of unnecessary rows.

answered Nov 23, 2010 at 8:22

JesseW

1,27511 silver badges19 bronze badges

4 Comments

Martin Schapendonk Over a year ago

If any "check for existence" query returns more than one row, I think it is more useful to double check your WHERE clause instead of LIMIT-ing the number of results.

Shantanu Gupta Over a year ago

I think Limit is used in Oracle and not in SQL Server

JesseW Over a year ago

I'm considering the case where they can legitimately be multiple rows -- where the question is: "Is there (one or more) rows that satisfy this condition?" In that case, you don't want to look at all of them, just one.

JesseW Over a year ago

@Shantanu -- I know, that's why I linked to the (very through) en.wikipedia article explaining the other forms.

Winston Smith · Accepted Answer · 2010-11-23 09:58:33Z

14

You can use:

SELECT COUNT(1) FROM MyTable WHERE ...

or

WHERE [NOT] EXISTS ( SELECT 1 FROM MyTable WHERE ... )

This will be more efficient than SELECT * since you're simply selecting the value 1 for each row, rather than all the fields.

There's also a subtle difference between COUNT(*) and COUNT(column name):

COUNT(*) will count all rows, including nulls
COUNT(column name) will only count non null occurrences of column name

edited Nov 23, 2010 at 9:58

answered Nov 23, 2010 at 8:18

Winston Smith

22k10 gold badges63 silver badges75 bronze badges

6 Comments

paxdiablo Over a year ago

You're making the mistaken assumption that a DBMS will somehow check all those columns. The performance difference between count(1) and count(*) will be different only in the most brain-dead DBMS.

paxdiablo Over a year ago

No, I'm saying that you are actually relying on implementation details when you state it'll be more efficient. If you really want to ensure you get the best performance, you should profile it for the specific implementation using representative data, or just forget about it totally. Anything else is potentially misleading, and could change drastically when moving (for example) from DB2 to MySQL.

paxdiablo Over a year ago

I want to make it clear I'm not dissing your answer. It is useful. The only bit I take issue with is the efficiency claim since we've done evaluations in DB2/z and found there's no real difference between count(*) and count(1). Whether that's the case for other DBMS', I can't say.

Winston Smith Over a year ago

"Anything else is potentially misleading, and could change drastically when moving (for example) from DB2 to MySQL" You're much more likely to get bitten by performance degradation of SELECT COUNT(*) when moving DBMS than an implementation difference in SELECT 1 or COUNT(1). I'm a firm believer in writing the code which most clearly expresses exactly what you want to achieve, rather than relying on optimizers or compilers to default to your desired behaviour.

James Anderson Over a year ago

Misleading statement "COUNT(*)" means 'count the rows' full stop. It does not require access to any particular column. And in most cases will not even require access to the row itself as a count any unique index is sufficient.

|

Pranav · Accepted Answer · 2019-07-19 19:36:17Z

3

Other option:

SELECT CASE WHEN EXISTS ( SELECT 1 FROM [MyTable] AS [MyRecord]) THEN CAST(1 AS BIT) ELSE CAST(0 AS BIT) END

answered Jul 19, 2019 at 19:36

Pranav

1555 bronze badges

2 Comments

Sumit Over a year ago

what is the purpose of CAST(1 AS BIT) ? Why can't I just write THEN 1 ELSE 0 ?

Pranav Over a year ago

You defiantly can return 1 or 0. It's all about what type of result you want in the end. I wanted to return boolean instead of numeric value.

Pranav · Accepted Answer · 2021-10-12 14:06:44Z

3

I'm using this way:

IF (EXISTS (SELECT TOP 1 FROM Users WHERE FirstName = 'John'), 1, 0) AS DoesJohnExist

edited Oct 12, 2021 at 14:06

Pranav

1555 bronze badges

answered May 23, 2018 at 10:04

DiPix

6,08318 gold badges76 silver badges123 bronze badges

Collectives™ on Stack Overflow

SQL: How to properly check if a record exists

9 Answers 9

5 Comments

3 Comments

4 Comments

2 Comments

4 Comments

4 Comments

6 Comments

2 Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

9 Answers 9

5 Comments

3 Comments

4 Comments

2 Comments

4 Comments

4 Comments

6 Comments

2 Comments

Comments

Linked

Related