Database; One or two unique fields?

Question

I will use a field named ID; an integer primary key. Is it any point to also specify a second unique field, to be able to reconstruct the tables if some ID fields/relationships are messed up, or is that not a worry?

I'm using auto-increment integer, with Sql-CE database. Single user.

John Saunders · Accepted Answer · 2011-01-01 22:10:52Z

If the semantics of your table require it, then add however many unique keys as you may need. For instance, if in your problem domain all "Employee" entities are uniquely identified by Name, then Name should have a unique index or constraint on it. If there can be two employees with the same name, then, of course, there should be no uniqueness constraint on that column.

I have a client now who needs to keep information per-school, per-fiscal year. They need a unique constraint (or index) on the school and year columns, combined. Sometimes these two are also the primary key, but even when they're not primary, they are unique.

marc_s · Accepted Answer · 2011-01-01 22:03:54Z

1

I don't really understand your motivation to add a second unique ID field - what for?? What's the benefit you see from that??

What you need is an ID that is unique, stable, and able to clearly identify each individual row and can serve as your primary key - an INT IDENTITY in SQL Server does that perfectly well.

I don't see any point or gain in adding a second unique ID to each row...

answered Jan 1, 2011 at 22:03

marc_s

760k186 gold badges1.4k silver badges1.5k bronze badges

2 Comments

bretddog Over a year ago

Well, if the relation was lost due to the database messing up the primary keys of the parent table, then it's no way to reconstruct the relationship, if no other field has a field corresponding to a parent table field. So the data is then useless. That's all my motivation. But I ask out of inexperience. If you tell me; no, nobody ever does this, then fine, I can just forget the idea :)

bretddog Over a year ago

For example, if I log some data for each day, and also I have a set of events on this day stored in another table. I would have a Day table, and an Event table. Day would have fields: ID, Date, SomeParameter. Event would have fields: ID, DayID, Note1, Note2. In this case I do not save the date in the event table. If I did, it would be duplicate data I guess. But that also means only relation that exist is the foreign key DayID.

duffymo · Accepted Answer · 2011-01-01 22:06:35Z

Every table has one or more candidate keys. Pick one to be the primary key; add UNIQUE constraints on the others to enforce semantic correctness. It has nothing to do with being able to reconstruct the table if something is "messed up".

You should also have indexes on columns that appear in WHERE clauses to make access as fast as possible. Those are application/use case specific.

An example of what I mean would be an address table. You might have a surrogate primary key. You'd also have indexes on different combinations of zip, city, province, county, etc. to make SELECTs efficient. You might also want to have the street1/stree2/city/province/zip combination to be UNIQUE. Depends on your business problem.

nvogel · Accepted Answer · 2011-01-02 04:07:31Z

A table should have as many keys as it needs to endure data integrity. If "ID" in this instance means a surrogate key then the answer is yes, you generally ought to have a natural key to ensure uniqueness in a properly normalized database - otherwise you could be duplicating meaningful business data. If you are identifying surrogate keys before natural keys then I think you are taking a flawed approach to design.

Good criteria for choosing keys are: familiarity, simplicity and stability. Add surrogate keys to your model only when and where you have good reason to, having regard the possible disadvantages and additional complexity.

Walter Mitty · Accepted Answer · 2011-01-02 14:17:12Z

Redundancy for the sake of backup can be a worthwhile design.

However, most people do not implement it by loading redundant columns into the schema tables, as your question suggests. There is a lot of overhead in doing it that way, and there are a lot of failure modes where the duplicate ID gets screwed up at the same time that the main ID gets screwed up.

You might be better off designing history tables that keep track of the history of IDs over some limited timespan. A lot of people design history tables as a staging area for off line storage, with only recent history retained in the database. This kind of history can address other problems in addition to the one you are interested in.

Collectives™ on Stack Overflow

Database; One or two unique fields?

5 Answers 5

Comments

2 Comments

Comments

Comments

Comments

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

Comments

2 Comments

Comments

Comments

Comments

Related