Difference between SELECT COUNT(*) and SELECT true finding specific row in a billion rows table

Question

I have a table with maybe 3-5 billions rows. I need to check if a specific value is in that table, which is the fastest way?

SELECT COUNT(*) AS total FROM schema.table WHERE row = 'pattern'; -- Must return 1 or 0

vs

SELECT true AS is_in_table FROM schema.table WHERE row = 'pattern' -- Must return true or no one row at all

Which is the best way to get the 'fastest' result using the appropriate column indexing?

Possible duplicate of Sql: How to properly check if a record exists — clinomaniac
– clinomaniac, Commented Dec 19, 2017 at 0:42

Gordon Linoff · Accepted Answer · 2017-12-19 00:43:00Z

The fastest way is to put an index on schema.table(row).

Then you can execute:

SELECT true AS is_in_table FROM schema.table WHERE row = 'pattern' LIMIT 1;

For this formulation, the LIMIT is important, unless you have explicitly declared row as unique (and even then I'm not 100% sure that MySQL will keep this in mind for the query).

The COUNT(*) will need to look for every value that might match, before returning a row. If the column is declared unique, then the performance should be similar between the two versions.

Collectives™ on Stack Overflow

Difference between SELECT COUNT(*) and SELECT true finding specific row in a billion rows table

1 Answer 1

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Linked

Related