[OpenID] Recycling OpenIDs (Was: What's broken in OpenID 2.0? (IIW session))

John Panzer jpanzer at aol.net
Wed May 16 17:28:19 UTC 2007


Martin Atkins wrote:
> John Panzer wrote:
>   
>> And believe me, AOL is very concerned about recycling and the issues 
>> therein.  We of course have a globally unique identifier that's used 
>> internally in exactly the way described above; this lets you 
>> disambiguate whether example.org/fred is the same fred as last year or a 
>> new fred.  For policy reasons we can't expose that GUID, but perhaps a 
>> hash(GUID,RP identifier) would be perfectly fine to expose in a standard 
>> "permaGUID" attribute.
>>
>> Yes, this doesn't help with disambiguating things like authors of blog 
>> posts in archives.  But there datestamps are usually available. 
>>
>>     
>
> An identifier plus a timestamp alone don't really help you much, because 
> you probably don't know at what point in time the identifier ceased to 
> be one person and started to be another.
>
> This problem is really in two halves, with different needs each:
>
>   A) HTTP URLs for authentication. This is to do with preventing a 
> subsequent identifier owner from accessing data created by prior owners.
>
>   B) HTTP URLs for identification. This is to do with figuring out who 
> actually did something given only an OpenID identifier as attribution.
>   
In many cases, you also have a time context.  Almost everything 
published on the web and other places has at least a simple timestamp on 
it: Blog posts, web pages, events, log entries... In a large and 
interesting subset of the problem space, you can make a 99% accurate 
inference that http://bob.com/ on May 15, 2007 is almost certainly the 
same person as http://bob.com/ on May 17, 2007.  If you have some best 
practices that put a known buffer between recyclings (1 month, 1 year, 
whatever) you can improve this accuracy.  And of course if you control 
the data you can always add a timestamp.  (If you don't control it, 
perhaps you can't achieve 100% accuracy anyway.)

In other word, I disagree with the premise that there's not enough 
information to achieve a reasonable approximation of B, as long as we 
restrict B to trying to answer "do identifiers X at time0 and X at time1 
denote the same identity?".

I don't see another way to solve this other than by adding a GUID to the 
identifier-as-published-on-the-web (or at least a revision number) which 
is, in a word, ugly.  Also no less prone to social engineering attacks IMHO.

-John
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openid.net/pipermail/openid-general/attachments/20070516/d5d3ea99/attachment-0001.htm>


More information about the general mailing list