Making a database identifier unique per user?

Cyno@programming.dev · 3 months ago

Making a database identifier unique per user?

originalucifer@moist.catsweat.com · 3 months ago

i think youre missing a table.

you have categories, you have users, and then you have an ‘assignment’ table that contains a user key and a category key… maybe its own auto incrementing identifier

pe1uca@lemmy.pe1uca.dev · 3 months ago

Plus I’d suggest having a slug so the user doesn’t have to memorize a meaningless number, instead a similar sounding string.

Instead of having 12345, something like category-1 for “Category 1”.
Specially for sharing with a URL, it’s more meaningful to share " domain.tld/search/categories/cat-1" than any other form of id (I’m annoyed with lemmy for not having a slug for posts, it feels so shady to share anything haha)

Cyno@programming.dev · 3 months ago

This is something I’ve been considering too, since the name is in this case unique per user I can just use it for everything in frontend rather than the ID. It’s not always a good solution though so I was wondering how would I solve it with IDs alone

pe1uca@lemmy.pe1uca.dev · 3 months ago

You shouldn’t use the name as a replacement for the ID, you need to use a slug.
The name should be stored as the user sets it, and the slug is autogenerated by your code removing any problematic character, so usually it only contains letters, numbers, and dashes, which makes it perfect to be a substitute for the numeric ID.
There should be libraries to handle this for you.

And ID is just something to identify a resource, so your ID in this case would be the slug.
I have a use case where the ID is generated by two fields, adapting it to your case would be something like /users/{user}/categories/{category}
So, whatever you define to be a unique way of working with an entity will be the identifier (ID) of that entity.

Cyno@programming.dev · 3 months ago

Good point, that sounds nicer than just encoding the name for sure, thanks

Cyno@programming.dev · 3 months ago

I have a join table between Category and other entities that can be categorized in this way, but I dont think I need one between User and Category? Different users can’t share the same category so it’s a 1-n relationship, not n-n.

Even if I did though I still have the same issue since I have to figure out how to autoincrement it, only now in the join table rather than the Category table.

originalucifer@moist.catsweat.com · 3 months ago

if cats are unique to a person, you would just use your auto-created id for the category table. sounds like you need to separate your internal IDs from your external (human readable IDs)…

if you need something human-readable, you would concat an additional field in the category table with a ‘category display id’ or somethin

lemmy_in@lemm.ee · 3 months ago

You could also just use a random non-numeric primary key. For example you could generate a string of 8 random characters + numbers. That would give you well over 2 billion possible IDs.

SzethFriendOfNimi@lemmy.world · 3 months ago

This is the kind of stuff that UUID’s are good for. When you want to generate non conflicting unique id’s. Even across systems and over time.

lemmy_in@lemm.ee · 3 months ago

I personally feel UUIDs are overused unless you happen to be running truly distributed systems that are all independently generating IDs.

In this case where the ID is also going to be in the URL, you’ve just added 32 characters to the URL that don’t need to be there. Since OP is apparently concerned with the look and feel of the URLs, I thought that UUIDs wouldn’t be the best option.

Cyno@programming.dev · 3 months ago

In the context of this small app im writing category is unique by name already so I can just use that if I wanted to go the string route, but agreed - yours is probably the standard way, youtube/reddit do it like that after all.

I’m still wondering about the technical implementation of it - where would you generate the string? Manually in backend before each save, probably using a locking mechanism to prevent accidentally creating 2 identical IDs at the same time? I’d have to do a db hit to make sure it doesn’t exist already every time, right? Maybe I just try to insert and see if it crashes due to the uniqueness index? Maybe I use a store procedure in the database to get a unique ID? Do I just hash the timestamp or sth like that?

Whether I generate a number or a string, feels like I always open it up to many issues.

lemmy_in@lemm.ee · 3 months ago

For small apps, generating it in the backend, trying to insert it, and then catching the exception should be totally fine. The odds of collision are quite small.

CrolishGrandma@lemmy.world · 3 months ago

Your description of the problem makes sense. If you think improving the user experience in this way is worthwhile, then go for it. Just remember, most users don’t pay attention to the address bar.

For implementation, I suggest creating a new table, something like ‘tb_user_category_index,’ with userId as a foreign key and an integer that increments each time it’s used. Automating this process in the database would be ideal: set it up once and let it run on its own, keeping things simple.

However, don’t confuse this cosmetic index with a primary key. The primary key should only serve as a unique identifier for a record and hold no other information.

Cyno@programming.dev · 3 months ago

By automating it you mean something a store procedure that returns the ID and increments the count at the same time or is there a more sophisticated way of doing it?

dbx12@programming.dev · 3 months ago

I think you shouldn’t bother. First, as another user already said, the normal user does not look much at the address bar. Second, how will you work with deletions? Say user created 4 categories 1-4 and deletes category 2. Now you either have a hole or you “reindex” the categories to 1-3, which is probably bad since it breaks any bookmarks the user created.

Cyno@programming.dev · 3 months ago

Deletions would work the same way as with a regular autoincrementing ID, it just always goes up. All it matters is that it doesn’t expose how many other IDs are in the DB

Muad'Dibber@lemmygrad.ml · 3 months ago

This is basic database 101 stuff, I’d recommend watching some videos on SQL. It will go through what a bridge table is.

breadsmasher@lemmy.world · 3 months ago

User(Id, name etc) 1 — * UserCategory(UserId, CategoryId) * — 1 Category(Id, name etc)

BehindTheBarrier@programming.dev · edit-2 3 months ago

Ultimately, I don’t see any need for users to care about ids. I’d just make a user ID column in the category table and use that for selecting categories. Rather think about a display name that is either known or made by the user.

The only problem with increasing numbers is if you don’t properly limit access to creator account and a user can just edit the url bar to get others categories. It’s still a problem with other unique stuff, but less easy to crack when it’s not sequential.

If you really want it, just replace ID in the url with the category name, if that is a simple string without any special characters. Actually useful for users visually, avoids numbers, users only see what they care about, still no bridging table. And the query is still a simple where user_id = x and category_name = y. This actually means non-unique names, but you are always filtering on users, so you can instead use a constraint or unique composite index on user_id and category name.

Personally i would stick with IDs because they are simple, and don’t change so they are not prone to renaming, special character issues, and whatever else string handing deals with. And it’s probably slower.