lookaside: Dehardcode some assumptions #170

bochecha · 2014-10-28T10:59:01Z

The current code assumes that:

the message contains a 'md5sum' key, which is going to go away when
we move to a different hash algorithm,
it knows the path to the file on the lookaside cache, which is going
to change very soon.

The solution this change implements is to simply take the path to the
uploaded source file entirely from the message.

However, the lookaside cache doesn't emit messages like this yet, so the
current code is kept as a fallback.

https://fedorahosted.org/rel-eng/ticket/5846

puiterwijk · 2014-10-28T11:40:30Z

fedmsg_meta_fedora_infrastructure/scm.py

+            try:
+                path = msg['msg']['path']
+
+            except KeyError:


I really don't like the idea of try-than-fail.
How about just checking "'path' in msg['msg']"?

puiterwijk · 2014-10-28T11:45:03Z

I'm not sure I prefer sending messages with just the path embedded completely: how about if people want to parse the filename or hashsum etc? I'd rather avoid having to tell them "just parse the path" if we have the information available when sending the message.

I think a better way might be to replace the md5hash thing by something like
{
hash: 'XXXXX',
type: 'md5',
}

This way, we could switch the hash algorithm without any trouble, and we would still get the information everywhere.

bochecha · 2014-10-28T11:53:12Z

I wasn't thinking of removing the informations currently in the message. People and tools trying to obtain informations like the checksum or the file name won't ever have to parse the path, they will continue to get them from the messages.

What I'm doing here is remove broken assumptions in a particular consumer (this project), not remove data from the messages emitted.

What you suggest (which is what I was already going to do on the emitter side actually) only helps for one assumption: the hash type.

However, there is a second assumption, which is just as broken: that the path to the file is made of "{prefix}/{name}/{filename}/{md5sum}/{filename}".

This second assumption is only true now, but it will be false when we finish the migration to another hash than MD5. (the hash type will be part of the directory structure, like so: "{prefix}/{name}/{filename}/{hash_type}/{checksum}/{filename}").

Realistically, only the lookaside server truely knows its path structure, consumers should not assume they can recreate from what they think are its components.

bochecha · 2014-10-28T12:06:45Z

@pypingou just made an interesting observation: the fallback code can never ever be eliminated as we need to be able to parse old messages.

So, as much as I prefer the « try new then fallback on the old in except » code (« it's easier to ask for forgiveness than permission »), it means that if we ever change again the messages in the future, we're going to have to deal with multiple exception levels.

In such a case, asking for permission does make much more sense, so here's an updated pull request which does just that. 😃

puiterwijk · 2014-10-28T12:16:52Z

Thanks for fixing that, and for re-assuring me that the checksum and filename etc attributes aren't removed.
In that case, I'm 👍 to this change.

The current code assumes that: 1. the message contains a 'md5sum' key, which is going to go away when we move to a different hash algorithm, 2. it knows the path to the file on the lookaside cache, which is going to change very soon. The solution this change implements is to simply take the path to the uploaded source file entirely from the message. However, the lookaside cache doesn't emit messages like this yet, so the current code is kept as a fallback. https://fedorahosted.org/rel-eng/ticket/5846

bochecha · 2014-10-28T12:19:55Z

Heh, turns out I broke the tests with a stupid syntax error. 😄

Fixed now (I just added a missing :), let's wait for Travis to finish before merging.

ralphbean · 2014-10-28T12:59:04Z

if we ever change again the messages in the future, we're going to have to deal with multiple exception levels.

Yeah, this is already the case with some of the other processors. ;)

ralphbean · 2014-10-28T13:00:05Z

👍 from me to merge. Thanks @bochecha, @puiterwijk, and @pypingou.

lookaside: Dehardcode some assumptions

This is the counterpart of this change: fedora-infra/fedmsg_meta_fedora_infrastructure#170 Now that is has been deployed, we can start emitting the new messages.

puiterwijk reviewed Oct 28, 2014
View reviewed changes

bochecha pushed a commit that referenced this pull request Oct 28, 2014

Merge pull request #170 from bochecha/feature/lookasidemsgs

526d283

lookaside: Dehardcode some assumptions

bochecha merged commit 526d283 into fedora-infra:develop Oct 28, 2014

bochecha deleted the feature/lookasidemsgs branch October 28, 2014 13:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lookaside: Dehardcode some assumptions #170

lookaside: Dehardcode some assumptions #170

bochecha commented Oct 28, 2014

puiterwijk Oct 28, 2014

puiterwijk commented Oct 28, 2014

bochecha commented Oct 28, 2014

bochecha commented Oct 28, 2014

puiterwijk commented Oct 28, 2014

bochecha commented Oct 28, 2014

ralphbean commented Oct 28, 2014

ralphbean commented Oct 28, 2014

lookaside: Dehardcode some assumptions #170

lookaside: Dehardcode some assumptions #170

Conversation

bochecha commented Oct 28, 2014

puiterwijk Oct 28, 2014

Choose a reason for hiding this comment

puiterwijk commented Oct 28, 2014

bochecha commented Oct 28, 2014

bochecha commented Oct 28, 2014

puiterwijk commented Oct 28, 2014

bochecha commented Oct 28, 2014

ralphbean commented Oct 28, 2014

ralphbean commented Oct 28, 2014