<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The discovery blog &#187; Richard Padley</title>
	<atom:link href="http://blogs.semantico.com/discovery-blog/author/richard-padley/feed/" rel="self" type="application/rss+xml" />
	<link>http://blogs.semantico.com/discovery-blog</link>
	<description>Semantico looks at online publishing</description>
	<lastBuildDate>Thu, 02 Sep 2010 10:22:31 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<image>
			<title>The discovery blog</title>
			<url>http://blogs.semantico.com/discovery-blog/wp-content/uploads/2008/11/logo64.png</url>
			<link>http://blogs.semantico.com/discovery-blog</link>
			<width>64</width>
			<height>64</height>
			<description>Semantico looks at online publishing</description>
		</image>		<item>
		<title>COUNTER Project announces new audit requirements</title>
		<link>http://blogs.semantico.com/discovery-blog/2010/08/counter-project-announces-new-audit-requirements/</link>
		<comments>http://blogs.semantico.com/discovery-blog/2010/08/counter-project-announces-new-audit-requirements/#comments</comments>
		<pubDate>Thu, 26 Aug 2010 11:07:40 +0000</pubDate>
		<dc:creator>Richard Padley</dc:creator>
				<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://blogs.semantico.com/discovery-blog/?p=1783</guid>
		<description><![CDATA[The COUNTER project recently announced that vendors wishing to remain compliant to the code of practice for books and reference works must now undergo a mandatory annual audit. Previously an audit was only required after the first year of delivering the usage reports.
This will be good news for those librarians who are concerned that the figures [...]]]></description>
			<content:encoded><![CDATA[<p>The COUNTER project <a href="http://www.projectcounter.org/documents/newsrelease_aug10.pdf">recently announced</a> that vendors wishing to remain compliant to the code of practice for books and reference works must now undergo a mandatory annual audit. Previously an audit was only required after the first year of delivering the usage reports.</p>
<p>This will be good news for those librarians who are concerned that the figures publishers provide after the first year of compliance might be inaccurate. An annual audit should ensure that no software errors suddenly appear.</p>
<p>However this will be bad news for those wanting to see costs kept down in a sector which is already hugely challenged by budget cuts. Publishers must bear the extra costs of the annual audit, and it&#8217;s hard to imagine that these will not be passed on to libraries in some way.</p>
<p>The audit must be carried out by a professional organisation recognised by the COUNTER project. This includes <a href="http://www.abc.org.uk/Corporate/AboutABCe/aboutABCe.aspx">ABC Electronic</a> in the UK, who offer a fixed price for this service. I contacted ABC Electronic to find out their prices but I did not recieve a reply.</p>
<p>It&#8217;s hard to imagine a worse time for this change. Although the financial impact will be fairly small, costs and budgets are being squeezed from all sides at the moment. Lets hope there are no more changes like this in the pipeline.</p>
<p><strong>Update</strong>: After I published this ABC Electronic saw this blog post and called me to apologise for not replying to my email enquiry, and I subsequently received pricing details for the COUNTER compliance auditing service they provide.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.semantico.com/discovery-blog/2010/08/counter-project-announces-new-audit-requirements/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why recycled journals systems don’t work for books and reference</title>
		<link>http://blogs.semantico.com/discovery-blog/2010/07/why-recycled-journals-systems-don%e2%80%99t-work-for-books-and-reference/</link>
		<comments>http://blogs.semantico.com/discovery-blog/2010/07/why-recycled-journals-systems-don%e2%80%99t-work-for-books-and-reference/#comments</comments>
		<pubDate>Thu, 15 Jul 2010 14:18:23 +0000</pubDate>
		<dc:creator>Richard Padley</dc:creator>
				<category><![CDATA[E-books]]></category>
		<category><![CDATA[Information Architecture]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://blogs.semantico.com/discovery-blog/?p=1689</guid>
		<description><![CDATA[Many publishers want to monetise their books and reference content by making these materials available online. And a common strategy to drive sales is to combine book and journal content within a single platform; synergies between different types of content should drive discovery and increase usage.
Some publishers choose to adapt an existing off-the-shelf journals platform [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blogs.semantico.com/discovery-blog/wp-content/uploads/2010/06/square-peg-round-hole.jpg"><img class="alignright size-full wp-image-1694" title="square-peg-round-hole" src="http://blogs.semantico.com/discovery-blog/wp-content/uploads/2010/06/square-peg-round-hole.jpg" alt="Square peg in round hole" width="272" height="322" /></a>Many publishers want to monetise their books and reference content by making these materials available online. And a common strategy to drive sales is to combine book and journal content within a single platform; synergies between different types of content should drive discovery and increase usage.</p>
<p>Some publishers choose to adapt an existing off-the-shelf journals platform to meet their needs for consolidation. However there can be technical problems inherent in this approach.<span id="more-1689"></span></p>
<p>In a <a href="http://www.crossref.org/CrossTech/2010/02/does_a_crossref_doi_identify_a.html">recent post</a> to the CrossTech blog, publishing guru Geoffrey Bilder analysed the issues facing CrossRef members wishing to use the DOI system for non-journal content. Geoff&#8217;s analysis holds good for more than just the CrossRef DOI system, so I&#8217;ve taken it as a starting point below.</p>
<p>Reference publications and databases introduce fundamental challenges to any existing system designed around the journal article content model. These challenges fall into two areas:</p>
<ul>
<li><strong>Structure</strong>. Reference works and databases can have complex nested substructures and there is a need for granular identification of these content substructures along with a mechanism for recording the relationship between them (e.g. “part-of” relationships between sub-section, section and chapter divisions, as well as “previous-next” navigational relationships between entries).</li>
<li><strong>Versioning</strong>. Unlike most journals, many reference books and databases change over time. To properly support archiving and perpetual access business models, there is a need to identify and maintain previous versions of reference content.</li>
</ul>
<p>Both of these areas require support from the core architecture of any publishing system; without such support publishers have to resort to ugly work-arounds such as coercing all content into journal-article-shaped chunks. Such work-arounds fundamentally compromise the end-user experience and ultimately risk devaluing the publisher brand.</p>
<p>Furthermore the technology stack commonly used to build a typical journals system does not necessarily provide a good base to build support for the structural and versioning requirements outlined above. This is because the mixture of technologies used tends to match closely the typical structure of a journal article.</p>
<p>In an article the metadata and content are two separable parts which map nicely to a relational metadata store (typically SQL or RDF) plus a full text retrieval engine (Lucene/Solr, Autonomy, etc). Although the full text may contain semantic annotations such as chemical or gene markup, the homogenous nature of the content means that a single metadata schema can be devised which will fit the whole collection well.</p>
<p>However, when publishers want to add non journal content to the system this simple separation of content and metadata will no longer suffice. Reference and book content is much more demanding in terms of hierarchical structure and navigation. Many hierarchical structures in reference books cannot easily (if at all) be modelled using conventional relational metadata; these structures fit much more naturally and easily into native XML. Trying to create a single relational metadata schema capable of modelling all hierarchies across the full variety of reference and book content that publishers produce is an impossible task.</p>
<p>The structural problems of adding books and reference content to systems designed to separate full text and metadata can be broken down into two distinct areas:</p>
<ol>
<li><strong>Fragile schema problem</strong>. The problem of updating conventional database schemas as requirements (and the real world) change is often called the fragile schema problem. Because the design of the database must be fixed at the outset, any changes to the system, to accommodate extra metadata, new content types or changes to business process become costly and risk-laden. This is because assumptions about the structure or schema of a database tend to be hard-wired into the structure of all the software written to talk to the database.</li>
<li><strong>Divorce of metadata from full text</strong>. Systems which use RDF or SQL databases for metadata suffer from a fundamental structural weakness as it is normally impossible to issue queries which examine both the full text and metadata within a single request. RDF suffers particularly in this respect as it has no support for XML mixed content; full text search plays no part in the RDF world view. Furthermore, such systems also often require content to be ‘loaded’ into several different internal content stores (e.g. once for full text index, once to ‘strip’ metadata to the RDF/SQL store). This builds a structural inefficiency into the system as content must be found, queried and updated in multiple locations.</li>
</ol>
<p>When we design the architecture for our publishing platforms we like to think hard about these problems. Often we conclude that XML database technology (such as <a href="http://www.marklogic.com/product/marklogic-server.html">MarkLogic Server</a>) provides the best solution<strong> </strong>because it allows content to be stored and queried in a single place. Using an XML database removes the artificial split between content and metadata inherent in conventional journals systems and allows us to build search queries across an entire collection of content. This in turn helps to drive discovery and usage by building deeper links between related content and combining and assembling content from an entire collection in new ways.</p>
<p><strong> </strong>However we also realise that different clients have different needs and that it is critical to prioritise meeting business goals ahead of making specific technology choices. The motivation for our overall technical architecture choice in a given project is simple; to learn the lessons of past systems and ensure we choose the best possible technology basis for any new  system. Understanding the problems publishers have faced in trying to adapt legacy journals systems to the more challenging world of books and reference helps us to make the most appropriate technology choices for future publishing platforms.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.semantico.com/discovery-blog/2010/07/why-recycled-journals-systems-don%e2%80%99t-work-for-books-and-reference/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ALPSP Seminar &#8211; Richard Padley speaks on The Future of Academic Publishing</title>
		<link>http://blogs.semantico.com/discovery-blog/2010/04/alpsp-seminar-richard-padley-speaks-on-the-future-of-academic-publishing/</link>
		<comments>http://blogs.semantico.com/discovery-blog/2010/04/alpsp-seminar-richard-padley-speaks-on-the-future-of-academic-publishing/#comments</comments>
		<pubDate>Fri, 30 Apr 2010 09:03:41 +0000</pubDate>
		<dc:creator>Richard Padley</dc:creator>
				<category><![CDATA[E-books]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://blogs.semantico.com/discovery-blog/?p=1438</guid>
		<description><![CDATA[The  Future of Academic Book Publishing was a one-day seminar which provided a unique opportunity for those attending to consider both the  present situation facing academic and scholarly publishers of all shapes  and sizes, and the likely direction for the business of academic book  publishing in the immediate future.
For those who [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.alpsp.org/ngen_public/article.asp?aid=111846">The  Future of Academic Book Publishing</a> was a one-day seminar which provided a unique opportunity for those attending to consider both the  present situation facing academic and scholarly publishers of all shapes  and sizes, and the likely direction for the business of academic book  publishing in the immediate future.</p>
<p>For those who were unable to attend, or would like to listen in full again, you can listen to a recording of <em>eBook readers and the Future of Other New Technologies</em> and view the accompanying slides below.</p>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="100%" height="81" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowscriptaccess" value="always" /><param name="src" value="http://player.soundcloud.com/player.swf?url=http%3A%2F%2Fsoundcloud.com%2Falablackman%2Frichard-padley-ebook-readers" /><embed type="application/x-shockwave-flash" width="100%" height="81" src="http://player.soundcloud.com/player.swf?url=http%3A%2F%2Fsoundcloud.com%2Falablackman%2Frichard-padley-ebook-readers" allowscriptaccess="always"></embed></object></p>
<p>If you would like to view the slides or listen later at your own leisure I have made the <a href="http://blogs.semantico.com/discovery-blog/wp-content/uploads/2010/04/1003FAB-Padley-for-web.pdf" target="_blank">accompanying slides</a> and an <a href="http://www.alpsp.org/docimages/1429.mp3" target="_blank">mp3 recording</a> of the presentation available here.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.semantico.com/discovery-blog/2010/04/alpsp-seminar-richard-padley-speaks-on-the-future-of-academic-publishing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Publisher choices for mobile delivery &#8211; apps vs websites</title>
		<link>http://blogs.semantico.com/discovery-blog/2010/04/publisher-choices-for-mobile-delivery-apps-vs-websites/</link>
		<comments>http://blogs.semantico.com/discovery-blog/2010/04/publisher-choices-for-mobile-delivery-apps-vs-websites/#comments</comments>
		<pubDate>Fri, 09 Apr 2010 09:42:26 +0000</pubDate>
		<dc:creator>Richard Padley</dc:creator>
				<category><![CDATA[E-books]]></category>
		<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Online Publishing Market]]></category>

		<guid isPermaLink="false">http://blogs.semantico.com/discovery-blog/?p=1285</guid>
		<description><![CDATA[The current explosion in interest around mobile devices such as the iPad and Kindle raises an interesting question for publishers. Is it best to create mobile specific websites to deliver content to smartphones, iPads or other mobile devices, or is it better to develop apps targeted at the iPhone, Android and other mobile platforms? In [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blogs.semantico.com/discovery-blog/wp-content/uploads/2010/04/Screen-shot-2010-04-08-at-16.56.01.png"><img class="alignright size-full wp-image-1288" title="The discovery blog on the iPhone" src="http://blogs.semantico.com/discovery-blog/wp-content/uploads/2010/04/Screen-shot-2010-04-08-at-16.56.01.png" alt="" width="174" height="323" /></a>The current explosion in interest around mobile devices such as the iPad and Kindle raises an interesting question for publishers. Is it best to create mobile specific websites to deliver content to smartphones, iPads or other mobile devices, or is it better to develop apps targeted at the iPhone, Android and other mobile platforms? In this post I examine the pros and cons of each approach.<br />
<span id="more-1285"></span></p>
<h2>Developing mobile optimised websites</h2>
<p>By developing a mobile optimised website publishers can provide the best user experience  for users with limited screen sizes and limited connection speeds. For example, this blog is fully optimised for mobile devices, so do try reading it from your smartphone to check out how the experience differs.</p>
<h3>Advantages</h3>
<ul>
<li>Lower cost of development (compared to app development)</li>
<li>Fast time to market</li>
<li>Support for dynamic content, content updates can happen in real time</li>
<li>Completely cross platform and cross device</li>
<li>The ability to leverage existing publishing commercial infrastructure; access controls and subscription models for existing websites apply equally well to mobile optimised websites</li>
<li>Leverage existing development and production processes and tools</li>
<li>Leverage publishers existing hosting and support services</li>
</ul>
<h3>Disadvantages</h3>
<ul>
<li>Internet connection required at all times</li>
<li>Speed &amp; latency of the site must be fully optimised</li>
<li>Mobile delivery creates new ways of using content and business models will have to adapt</li>
<li>Discovery of the content will be via existing channels; SEO work may be needed to ensure users can find your content effectively</li>
</ul>
<h2>Developing apps</h2>
<p>Developing an app allows a publisher to take full control of all of the capabilities of the underlying device. It gives you the opportunity to use every advantage that mobile delivery has to offer. If fast interactive graphics and immersive functionality are important to your content then developing an app will be the most appropriate choice.</p>
<h3>Advantages</h3>
<ul>
<li>You can control the user experience, ensuring all users get the best you can offer</li>
<li>Ability to use native UI for fast graphics</li>
<li>You can leverage existing distribution and monetization services via app stores (e.g. Apple and Google)</li>
<li>B2B publishers can extend their reach into direct consumer markets by using app store services</li>
<li>The app upgrade mechanism can be used to push new content and functionality onto devices automatically</li>
</ul>
<h3>Disadvantages</h3>
<ul>
<li>Custom app development may be expensive</li>
<li>Publishers may have to re-invent the wheel by re-implementing solutions to problems they have already solved in the web environment</li>
<li>There is little or no potential for sharing development effort between different mobile platforms</li>
<li>Lack of ways to hyperlink apps together. Academic citation depends on linking but apps cannot be linked in the way web resources can.</li>
<li>Limited memory of mobile devices means large publisher datasets cannot be accomodated.</li>
</ul>
<p>These are just some of the pros and cons of each approach. Inevitably the best choice in any given situation will depend on the content, the market, the level of investment needed and the risk publishers are willing to take on individual titles.</p>
<p>At Semantico we anticipate that some new titles and projects will need development for both web and apps whilst existing online titles may have apps designed for them. By offering both web and app development we&#8217;re giving publishers the chance to treat the two ways of presenting content as one project.</p>
<p>Each one of the bullet points could be a blog post in itself and no doubt we&#8217;ll be revisiting some of these issues future posts. Because whilst the issue of whether or not apps are the future of ebooks is still undecided, those publishers who&#8217;ve not yet dipped their toe in the water need to start rolling up their trousers. Do <a href="http://blogs.semantico.com/discovery-blog/contact/">contact us</a> now to start a conversation about your mobile delivery options.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.semantico.com/discovery-blog/2010/04/publisher-choices-for-mobile-delivery-apps-vs-websites/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>iBooks or Apps? The publisher&#8217;s dilemma</title>
		<link>http://blogs.semantico.com/discovery-blog/2010/02/ibooks-or-apps-the-publishers-dilemma/</link>
		<comments>http://blogs.semantico.com/discovery-blog/2010/02/ibooks-or-apps-the-publishers-dilemma/#comments</comments>
		<pubDate>Wed, 24 Feb 2010 07:51:25 +0000</pubDate>
		<dc:creator>Richard Padley</dc:creator>
				<category><![CDATA[E-books]]></category>
		<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Online Publishing Market]]></category>
		<category><![CDATA[Publishing business models]]></category>

		<guid isPermaLink="false">http://blogs.semantico.com/discovery-blog/?p=1138</guid>
		<description><![CDATA[Should publishers sell books using Apple’s App Store or iBookstore?
Many publishers have started using the iPhone App Store as a channel to sell book content by packaging e-books as applications. There are currently 18,000 books in the App Store, and books are the fastest growing category of application in the store.
With the launch of the [...]]]></description>
			<content:encoded><![CDATA[<p><em>Should publishers sell books using Apple’s App Store or iBookstore?</em></p>
<p>Many publishers have started using the iPhone App Store as a channel to sell book content by packaging e-books as applications. There are currently 18,000 books in the App Store, and books are the fastest growing category of application in the store.</p>
<p>With the launch of the iPad and the iBookstore, Apple has given publishers another option for delivering content. In this post we analyse the pros and cons of both approaches.<span id="more-1138"></span></p>
<table border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="top"></td>
<td valign="top"><strong>e-book App</strong></td>
<td valign="top"><strong>iBook download</strong></td>
</tr>
<tr>
<td valign="top"><strong>Business model</strong></td>
<td valign="top">Publisher free to set price point. Apple take 30% of revenues.</td>
<td valign="top">Publisher free to set price point. Apple take 30% of revenues.</td>
</tr>
<tr>
<td valign="top"><strong>Production process</strong></td>
<td valign="top">Convert content to online PDF or XML.</p>
<p>Design, build, test and debug application. Submit to Apple.</td>
<td valign="top">Convert content to ePub XML.</td>
</tr>
<tr>
<td valign="top"><strong>Approval process</strong></td>
<td valign="top">Apps approved on an individual basis by Apple at their exclusive discretion. Process known to be slow and subject to censorship.</td>
<td valign="top">Currently unknown.</p>
<p>The process must scale to significantly higher volumes than the app approval process. Withholding approval by Apple much less likely as censorship of books could be damaging.</td>
</tr>
<tr>
<td valign="top"><strong>Discoverability</strong></td>
<td valign="top">Good via iTunes, but lacking specific book specific features. Store is geared more towards conventional apps.</td>
<td valign="top">Currently unknown.</p>
<p>Delivering iBooks via a separate channel from apps will enable Apple to build book specific features such as full text search, content previews, related reading, discussion forums etc.</p>
<p>These features would be in direct competition with Amazon so search engine optimisation will be critical.</td>
</tr>
<tr>
<td valign="top"><strong>Delivery platforms</strong></td>
<td valign="top">Available on iPhone, iPod and iPad from launch.</td>
<td valign="top">US iPad only at launch.</p>
<p>Apple would be missing a significant opportunity if the iBookstore is not made available on the iPhone quickly. The use of ePub would allow Apple to deliver the iBook reader application to desktop machines in addition to the iPad after launch.</td>
</tr>
</tbody>
</table>
<h2>Conclusion</h2>
<p>For straightforward chapter based book content it seems clear there is no longer a compelling case for publishers to deliver e-books as apps. The extra cost of software development, combined with the slowness and lack of scalability in the approval process no longer make sense now that Apple have introduced the iBookstore.</p>
<p>For other types of content the case is not so clear cut. Publishers with the ability to invest can develop reference based apps which add value by delivering content in context. Mobile workflow applications will still be a signifiant growth opportunity for publishers.</p>
<p>As the iBookstore is currently geared toward the consumer market, publishers who deliver large databases of journal and book content to institutional markets should look to the iBook model as a way of tapping into a traditionally harder to reach individual market. And publishers wishing to monetise currently offline backlist content should look carefully at the opportunities afforded by the iBook platform.</p>
<p>Although the iBook reader app is not currently available on the iPhone there seems no compelling reason why it will not be released in response to market demand. Bearing in mind the popularity of e-book applications on the iPhone this would appear to be a very simple decision for Apple.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.semantico.com/discovery-blog/2010/02/ibooks-or-apps-the-publishers-dilemma/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>The Challenge of Online Identity: Part 3</title>
		<link>http://blogs.semantico.com/discovery-blog/2010/02/the-challenge-of-online-identity-part-3/</link>
		<comments>http://blogs.semantico.com/discovery-blog/2010/02/the-challenge-of-online-identity-part-3/#comments</comments>
		<pubDate>Wed, 03 Feb 2010 14:27:27 +0000</pubDate>
		<dc:creator>Richard Padley</dc:creator>
				<category><![CDATA[Access and identity management]]></category>
		<category><![CDATA[Online Identity 2.0]]></category>
		<category><![CDATA[Online Publishing Market]]></category>
		<category><![CDATA[Publishing business models]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://blogs.semantico.com/discovery-blog/?p=973</guid>
		<description><![CDATA[This is the third and last of a series of blog posts (see part one and part two) in which I set out to examine the current state of identity management in our industry and where it’s going. The real point of this series has been to answer the question (which will be familiar to [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-723" title="Fingerprint" src="http://blogs.semantico.com/discovery-blog/wp-content/uploads/2009/11/fingerprint.gif" alt="Fingerprint" width="213" height="168" />This is the third and last of a series of blog posts (see <a href="http://blogs.semantico.com/discovery-blog/2009/11/the-challenge-of-online-identity/">part one</a> and <a href="http://blogs.semantico.com/discovery-blog/2009/11/the-challenge-of-online-identity-part-2/">part two</a>) in which I set out to examine the current state of identity management in our industry and where it’s going. The real point of this series has been to answer the question (which will be familiar to any parent of children who drives) ‘<strong>Are we there yet?</strong>’ – the destination in this case being not Legoland, but a much-discussed concept in our industry, Online Identity 2.0.<span id="more-973"></span></p>
<h1>Are we there yet?</h1>
<p>Along the way I’ve surveyed the current landscape and looked at the multi-dimensional influence of Web 2.0. The journey has highlighted tensions caused by Web 2.0, and the new software models of the API-driven web, and the pressure it puts on existing models of identity management which, as I have attempted to show, struggle to cope with the complexity of this new universe.</p>
<p>In this post I want to delve a little deeper into the privacy and security implications of that environment and to look forward to the semantic web, before making some recommendations for how I think identity management needs to develop in order to get us to our goal.</p>
<h1>Lock-in, ownership &amp; control</h1>
<p>The providers of social network services have been quick to understand the potential of being online identity hubs for their users. This is a natural function of their prime aim of driving up usage; being an identity provider is just one more service that Google or Yahoo can deliver and one which keeps them firmly at the centre of our online worlds.</p>
<p>As we move more of our identity information online it becomes potentially much easier to move that information around. With that ease of moving data around, concern increases that our personal data could be passed on to third parties. At best it might then be used to spam us, or be sold to our competitors. At worst, we might loose control over ownership of our online identity altogether, and it could be used for fraudulent purposes. These data protection issues are furthermore set in a global context where national legal frameworks may no longer make sense.</p>
<h2>Who to trust</h2>
<p>In this federated environment of identity providers there are a number of important questions that must be addressed, including:</p>
<ol>
<li>How much do we trust these identity providers with our personal details?</li>
<li>Who audits services such as Facebook or LinkedIn to ensure security issues are addressed?</li>
<li>How much to we trust the downstream sites using these identity services with our personal details?</li>
</ol>
<p>With these questions in mind, it is possible to imagine an ‘identity supply chain’ where different entities within the chain only know the smallest parts of a given identity that are needed to perform their function. For example, I could log in to a website without the website itself knowing my password. Similarly I could order goods without disclosing my full identity to the shipping agent, and I could leave commentary on a blog without the blog system needing to know my postal address.</p>
<h1>The semantic web</h1>
<p>In the web of <a href="http://linkeddata.org/">linked data</a>, identity is centrally important for determining trust, provenance and authenticity. Understanding who made a particular assertion is essential within scientific communication, for example, which is necessarily a continuous debate. In such a discourse, degrees of trust and certainty are necessarily important in evaluating and combining facts from different sources. Identity is needed for:</p>
<ul>
<li><strong>Annotation</strong> &#8211; making commentary and building discussion around facts and data</li>
<li><strong>Augmentation</strong> &#8211; adding new data and assertions to existing data sets based on new evidence and experiments</li>
<li><strong>Refutation</strong> &#8211; allowing statements to be contradicted according to new evidence or new interpretation of existing evidence, based on different degrees of trust within a system</li>
</ul>
<p>All of the above types of communication are quite possible using the linked data semantic web model we have today. For example, I am free to publish any kind of statements I like which refer to other statements. However in doing this it is critical that the notion of my identity is preserved in relation to the statements I make, just as its is critical that the identity of the author of the original statements is clear.</p>
<h2>Wikipedia or Schizopedia?</h2>
<p>The huge success of Wikipedia has shown that collaboration and openness combined with low cost of usage can combine to produce true value. The centralised model of Wikipedia allows the tracking of individual edits but not necessarily on a named or identified user basis. Essentially, the decreased end user cost of building the information resource has been traded against a consequent lack of traceability and provenance.</p>
<p>Compare this with the semantic web linked data model. Here no centralisation is expected (or even possible). I can publish some facts, you can comment, agree, refute or augment these facts, and by publishing your assertions in the linked data cloud you can also join the conversation. So can many other people, with many other agreements, contradictions and additional observations. Here the conversation could start to resemble schizophrenia, with many voices talking at once. Without a solid notion of identity and provenance it is impossible to build a consistent and coherent model of the facts.</p>
<h1>New licence models</h1>
<p>Many publishers have traditionally licensed their intellectual property to third parties in the form of data sets. This could be, for example, to provide language translation devices to language students, or alternatively to provide an abstract service to complement the primary source materials in a particular discipline. These licence deals inevitably involve the simple transfer of the published content as a set of static files from licensor to licensee.</p>
<p>However, as the service offered by publishers matures to include APIs to their data and services instead of simple file transfer, the need to address identity issues arises. Typical identity issues include:</p>
<ul>
<li>The need to ensure the API is used only by the licensee</li>
<li>The need to record usage metrics by each licensee</li>
</ul>
<p>Additional issues may include the need to track the licensee’s individual users and their usage patterns. Furthermore if the API delivers services to the licensee’s end users beyond simple search and content retrieval, then it may be necessary to exchange authentication and identity information about the end user of the site.</p>
<h1>Conclusions</h1>
<p>Drawing all the threads together that I have explored in this and my previous posts on Identity Management I have come up with the following conclusions.</p>
<p>The answer the question ‘are we there yet?’ is of course no, not yet. The more interesting question is whether we are even on the right road to get there. In order for us to reach our destination, a rational and usable system for managing identities on the web, the following needs to happen in my view.</p>
<p>Publishers and information professionals need to collaborate to design an identity framework what meets the needs of all stakeholders including contributors, researchers and institutions. This framework should be built on existing open standards such as OpenID and DOI but <strong>must not sacrifice usability</strong>. The solution must be built on an organisational infrastructure which is credible and can be trusted across the industry, and should ideally be based on <strong>open source software</strong> which can be independently audited for security concerns by any interested party.</p>
<p>The starting point for such a collaboration may already be in place with the recently announced <a href="http://orcid.securesites.net/">ORCID</a> (Open Researcher and Contributor ID) initiative. I for one will be watching this project closely over the coming months and I look forward to their developments and progress.</p>
<p>And in the meantime … keep quiet you kids there in the back!</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.semantico.com/discovery-blog/2010/02/the-challenge-of-online-identity-part-3/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What Apple’s iPad and iBookstore mean for publishers</title>
		<link>http://blogs.semantico.com/discovery-blog/2010/01/what-apple%e2%80%99s-ipad-and-ibookstore-mean-for-publishers/</link>
		<comments>http://blogs.semantico.com/discovery-blog/2010/01/what-apple%e2%80%99s-ipad-and-ibookstore-mean-for-publishers/#comments</comments>
		<pubDate>Wed, 27 Jan 2010 21:34:47 +0000</pubDate>
		<dc:creator>Richard Padley</dc:creator>
				<category><![CDATA[E-books]]></category>
		<category><![CDATA[Online Publishing Market]]></category>
		<category><![CDATA[Publishing business models]]></category>

		<guid isPermaLink="false">http://blogs.semantico.com/discovery-blog/?p=1078</guid>
		<description><![CDATA[Publishing techies like myself have been waiting for a long time for Apple to launch their tablet device. With the accelerating interest in eBooks, and the ignition of the eBook marketplace with devices such as the Kindle and Sony reader, I’ve been keen to see how Apple’s entry into the tablet marketplace will change the [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-1083" title="iBook reader app" src="http://blogs.semantico.com/discovery-blog/wp-content/uploads/2010/01/ibooks_20100127.jpg" alt="iBook reader app" width="249" height="246" />Publishing techies like myself have been waiting for a long time for Apple to launch their tablet device. With the accelerating interest in eBooks, and the ignition of the eBook marketplace with devices such as the Kindle and Sony reader, I’ve been keen to see how Apple’s entry into the tablet marketplace will change the landscape. And the conclusion I’ve come to is that Apple stand a good chance of stealing the consumer eBook show.<span id="more-1078"></span></p>
<h1>The iPad</h1>
<p>Apple have approached the challenge of creating a tablet computer from the opposite direction to most established devices on the market. Instead of stripping down a laptop or netbook they have chosen to create a device which is essentially a scaled up iPhone. The iPad runs the iPhone OS, and consequently runs all of the 140,000+ currently available iPhone applications. The user experience is very similar to the iPhone, albeit with all of the advantages of a much larger screen, faster processor and more memory.</p>
<h1>The iBookstore</h1>
<p>OK, lets cut straight to the chase. For publishers the iBookstore is the iPad’s main event. Apple have signed deals with Penguin, HarperCollins, Simon &amp; Schuster, Macmillan and Hachette to seed the iBookstore with content in the EPUB format. Apple are planning to offer 70% of the sales revenue to publishers, following closely the model already in place for application developers in their app store. This compares very favourably to the cut of up to 50% which Amazon take from titles sold on the Kindle.</p>
<p>The business model is a one-time purchase of the content which will be downloaded to the user&#8217;s iPad device. The iBookstore uses the EPUB format, which allows the iBook reader application to reflow the text according to screen orientation and to change fonts and font sizes.</p>
<p>The iBookstore will be the only way of purchasing eBook content to use with the built in iBook application, although other applications, such as Stanza, will be free to compete with Apple’s business model. If they can.</p>
<h1>The DRM issue</h1>
<p>Digital rights management will not be needed with the iPad because content, once downloaded from the iBookstore, cannot be copied from one device to another. This is because Apple have followed the iPhone model in making the iPad an essentially closed, locked device. There is no user access to the underlying file system, and moreover only books purchased from the iBookstore can be read with the built in iBook reader application.</p>
<h1>A device I’d give my grandmother</h1>
<p>Apple know a thing or two about usability. This device neatly fills the gap between the iPhone and a MacBook. It is essentially a laptop for those who don’t want a laptop or a smartphone. The iPhone OS is so simple and intuitive to use that it would really be suitable for even the most computer phobic. And this weighs heavily in my thinking that it will win out as a consumer eBook device.</p>
<p>Although at $499 it is twice the cost of the Kindle or Sony readers (for an entry level model) the extra cost is extremely easy to understand. Colour, smooth graphics and the availability of 140,000 applications make it clear this is much more than a single function device. In this respect it simply can’t be compared to the Kindle or Sony reader.</p>
<h1>Conclusion</h1>
<p>In many ways Apple are attempting to replicate the situation which led to the huge success of the iPod/iTunes combination in disrupting the offline music sales and distribution industry nine years ago. By providing sexy, easy to use commoditised hardware at the right price point, combined with a single simple way to purchase content, they stand an extremely good chance of disrupting the established eBook readers. Furthermore the sheer appeal of the device can only fuel the further growth of the eBook marketplace.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.semantico.com/discovery-blog/2010/01/what-apple%e2%80%99s-ipad-and-ibookstore-mean-for-publishers/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>&#8216;Innovation from product to production&#8217; at the STM E-Production Seminar</title>
		<link>http://blogs.semantico.com/discovery-blog/2010/01/innovation-from-product-to-production-at-the-stm-e-production-seminar/</link>
		<comments>http://blogs.semantico.com/discovery-blog/2010/01/innovation-from-product-to-production-at-the-stm-e-production-seminar/#comments</comments>
		<pubDate>Wed, 13 Jan 2010 13:12:25 +0000</pubDate>
		<dc:creator>Richard Padley</dc:creator>
				<category><![CDATA[Access and identity management]]></category>
		<category><![CDATA[E-books]]></category>
		<category><![CDATA[Information Architecture]]></category>
		<category><![CDATA[Project Management]]></category>
		<category><![CDATA[Publishing business models]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://blogs.semantico.com/discovery-blog/?p=990</guid>
		<description><![CDATA[Written and delivered in partnership with Andrea Powell from CABI, this presentation is a case study of lessons drawn from the CAB Direct project, and highlights issues which are relevant across the board for publishers delivering online content. This includes looking at how to maximise value in the design of taxonomies and coding systems, how designing and [...]]]></description>
			<content:encoded><![CDATA[<p>Written and delivered in partnership with Andrea Powell from <a title="CABI" href="http://www.cabi.org/" target="_blank">CABI</a>, this presentation is a case study of lessons drawn from the <a href="http://cabdirect.org/">CAB Direct</a> project, and highlights issues which are relevant across the board for publishers delivering online content. This includes looking at how to maximise value in the design of taxonomies and coding systems, how designing and improving user experience on the product side can lead to more stringent data quality requirements and some design strategies to minimise ongoing operational costs when designing data transfer workflows between systems. We also look at innovation in the design of machine level API interfaces.<br />
<script type="text/javascript">// < ![CDATA[
 function openwindow() {  window.open("http://river-valley.tv/media/conferences/stm-eproduction-2009/0102-Richard_Padley", "mywindow", "menubar=1, resizable=1, width=920, height=509"); }
// ]]&gt;</script><br />
<a href="javascript: openwindow()">You can watch</a> the full presentation (45 <abbr title="minutes">mins</abbr>) given to the STM E-Production Seminar on 3rd December in Kensington London. Please note that the video will be displayed in a new window.</p>
<p><noscript>You also need to have JavaScript enabled in your browser to view the video.</noscript></p>
<p>More on this excellent seminar can be found at <a title="STM E-Production Seminars" href="http://www.stm-assoc.org/event_presentations.php?event_id=18" target="_blank">The International Association of Scientific Technical and Medical Publishers</a> website.</p>
<p>Video by <a title="River Valley TV" href="http://river-valley.tv/" target="_blank">River Valley TV</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.semantico.com/discovery-blog/2010/01/innovation-from-product-to-production-at-the-stm-e-production-seminar/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Challenge of Online Identity: Part 2</title>
		<link>http://blogs.semantico.com/discovery-blog/2009/11/the-challenge-of-online-identity-part-2/</link>
		<comments>http://blogs.semantico.com/discovery-blog/2009/11/the-challenge-of-online-identity-part-2/#comments</comments>
		<pubDate>Wed, 18 Nov 2009 16:14:30 +0000</pubDate>
		<dc:creator>Richard Padley</dc:creator>
				<category><![CDATA[Access and identity management]]></category>
		<category><![CDATA[Online Identity 2.0]]></category>
		<category><![CDATA[Online Publishing Market]]></category>
		<category><![CDATA[Publishing business models]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://blogs.semantico.com/discovery-blog/?p=764</guid>
		<description><![CDATA[The ability to personalise and customise websites is taken for granted in the Web 2.0 world. But new user expectations provide problems for identity management.]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-723" title="Fingerprint" src="http://blogs.semantico.com/discovery-blog/wp-content/uploads/2009/11/fingerprint.gif" alt="Fingerprint" width="213" height="168" />In <a href="http://blogs.semantico.com/discovery-blog/2009/11/the-challenge-of-online-identity/" target="_self">part one</a> of this series of three posts I attempted to describe the authentication and identity management environment that currently exists within the information industry. Next I’d like to look a bit more closely at the areas of <strong>personalisation</strong>, <strong>usage metrics</strong> and <strong>usability.</strong> These are all areas that hold significant challenges for online identity; issues which have particular bearing on the Web 2.0 services we all accept now as an established feature of mainstream internet use.<span id="more-764"></span></p>
<h2>Personalisation</h2>
<p>The ability to personalise and customise web sites is taken for granted in the Web 2.0 world. We are now used to logging on to services such as Twitter, Facebook and Google and being delivered into environments that we can customise and personalise to our own needs.</p>
<p>The expectations of users within this new world provide problems for identity management, which, as we saw in my last post, has to use an infrastructure that was never designed to cope with such user requirements. The danger is that too inflexible a system might act as a brake, or even a deterrent.</p>
<p>Flexible identity management solutions are critical to the uptake of personalisation services because they lower the barriers to access to new services. Commenting on a blog post must be made as simple as possible, yet in order to preserve my online identity in any comments I make it is essential that I can authenticate myself. The barrier here in many cases is that I don’t want to register yet another username and password with each blog site I visit in order to identify myself. One solution to this is the <a href="http://openid.net/">OpenID</a> protocol which allows me to log into participating third party sites using the identity that I’ve already established using an identity provider such as Google, Yahoo or Flickr. Few publishers have so far adopted the OpenID standard; notably Reed-Elsevier have set up an OpenID identity provider, <a href="http://idkee.nl/" target="_self">IDkee</a>; let&#8217;s hope OpenID authentication is soon to be implemented for their flagship site Science Direct.</p>
<p>Many current publisher sites provide personalisation features but these functions tend to exist exclusively within the publisher’s own silo. If I store bookmarks or reading lists within one publisher silo, there is no way to share these with my social contacts across the web. If I contribute to a discussion within one publisher platform, that discussion tends to stay behind the paywall of the silo. Why can&#8217;t I call up a view that shows my comments across all the publisher sites I use on the web?</p>
<h2>Usage metrics</h2>
<p>If the multiple identities we all have to maintain within the Web 2.0 world provide problems for users trying to access services, they also cause snarl-ups at the back-end of the process – with the tracking of usage by providers of those services.</p>
<p>Usage metrics are important for publishers. Mainstream web analytics software and techniques tend to focus on general trends and usage patterns because it is not straightforward to reliably identify individual users, but for publishers, identifying institutional users and correctly accounting for their usage is critical in delivering industry reports compliant with the <a href="http://www.projectcounter.org/">COUNTER</a> standard.</p>
<p>However, the COUNTER model does not even begin to address the complexities that can arise when multiple identities are in play. For example, I might need to express three different levels of identity at the same time:</p>
<ol>
<li>My own personal account, for content I have purchased for my own private use.</li>
<li>My department’s subscription to an information resource (not shared with the rest of the institution)</li>
<li>My institution’s identity and its subscription rights to an online resource.</li>
</ol>
<p>In this scenario, when I access a piece of content online in a publisher platform, the access could be accounted for at any one of the three levels above. The COUNTER standards do not currently cover this type of compound identity problem.</p>
<p>The semantic web also presents a serious challenge for usage metrics. Once facts are published in the linked data cloud, measurement of usage becomes impossible: those facts could reside in any system or platform and are no longer under the control of one publisher.</p>
<p>Usage metrics are important for publishers not only as an accounting tool – they also help publishers to improve the usability of their services. And usability is another critical issue bearing on access management.</p>
<p>The <a href="http://shibboleth.internet2.edu/">Shibboleth</a> protocol was set up as a way of making things easier for users accessing services, and to streamline this question of multiple identities somewhat. However, Shibboleth itself has turned out to have some usability problems.</p>
<h2>Usability issues</h2>
<p>If Wikipedia required users to login with a personal username and password how much would it affect usage?</p>
<p>This question aside, the federated nature of the Shibboleth single-sign-on system means that in order to log in users first have to confirm where they come from, in order that they can be directed to the correct identity provider. This problem is further compounded by the large number of identity providers in the Shibboleth system (740 in the UK Federation alone). Asking a user to select from a list of 740 places is clearly a significant barrier to usability.</p>
<p>OpenID based systems don’t have the problem of large federations, since anyone, potentially, can set up an identity provider service. The usability challenge here is to pick the simplest list of providers and give the user an alternative way of entering a full OpenID URL if their provider is not on the list. Again, another usability challenge most users would rather not face.</p>
<p>Finally, and not to be forgotten, is the thorny problem of logging users out of a service. In a single-sign-on environment the <a href="https://spaces.internet2.edu/display/SHIB2/SLOIssues">problems with single log-out</a> are surprisingly complex. It requires coordination between all the sites a user has visited and, combined with the difficulty in educating users that logging out is even necessary, presents another usability challenge.</p>
<h2>Conclusion</h2>
<p>Identity management plays a key role in the Web 2.0 world and this being so, Web 2.0 tools need to step up to the privacy and security challenges raised by the new software models of the API-driven web. In particular, thought needs to be given to these key issues of personalisation, usage metrics and usability.</p>
<p>I have one further post to deliver on the subject of online identity. This will consider the question of how close we are in reality to delivering on what has been called Online Identity 2.0. And I&#8217;ll also be examining a further trio of key issues key for the semantic web: provenance, trust and authenticity.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.semantico.com/discovery-blog/2009/11/the-challenge-of-online-identity-part-2/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>The Challenge of Online Identity</title>
		<link>http://blogs.semantico.com/discovery-blog/2009/11/the-challenge-of-online-identity/</link>
		<comments>http://blogs.semantico.com/discovery-blog/2009/11/the-challenge-of-online-identity/#comments</comments>
		<pubDate>Fri, 06 Nov 2009 10:08:57 +0000</pubDate>
		<dc:creator>Richard Padley</dc:creator>
				<category><![CDATA[Access and identity management]]></category>
		<category><![CDATA[Online Identity 2.0]]></category>
		<category><![CDATA[Online Publishing Market]]></category>
		<category><![CDATA[Publishing business models]]></category>

		<guid isPermaLink="false">http://blogs.semantico.com/discovery-blog/?p=716</guid>
		<description><![CDATA[Publishers and information providers are in danger of cultivating a blind spot to one of the key issues currently inhibiting the growth of online information services: identity management.
The web as it exists today suffers from the lack of a consistent way of managing identity. There are challenges in both identifying myself to the sites I [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-723" title="Fingerprint" src="http://blogs.semantico.com/discovery-blog/wp-content/uploads/2009/11/fingerprint.gif" alt="Fingerprint" width="213" height="168" />Publishers and information providers are in danger of cultivating a blind spot to one of the key issues currently inhibiting the growth of online information services: identity management.</p>
<p>The web as it exists today suffers from the lack of a consistent way of managing identity. There are challenges in both identifying myself to the sites I visit and in identifying those sites myself. Without any standard mechanism to deal with this, web developers have devised an array of different and incompatible schemes to manage identity. This presents serious challenges, since authenticity and trust are critically important concerns for publishers and information providers.</p>
<p><span id="more-716"></span>In the beginning, the internet was a place of implicit trust. Networks were small, and users were trusted with not abusing services. During the rapid growth of the network, many security and identity problems with the underlying network protocols and subsystems were discovered and solved. However the web itself has followed a different path of evolution.</p>
<h2>Hypertext and identity</h2>
<p>Early hypertext systems were firmly based on the idea that all users would need to be able to read and write documents. The models built into the first systems, such as <a href="http://www.xanadu.net/">Xanadu</a>, allowed users to compose their own documents and to annotate documents authored by others. This functionality naturally required users to authenticate and identify themselves.</p>
<p>However, when Tim Berners-Lee invented the web in the late 1980s the distributed authentication services to support the read/write web did not yet exist. He simplified the original hypertext vision in order to make it easier to implement by removing the composition and annotation elements. This in turn removed the need for identity management systems which significantly reduced the complexity of the software needed. This simplification allowed the web to grow quickly at the cost of initially making it into an essentially read only environment for most users.</p>
<h2>Visitors and residents</h2>
<p>Recent JISC research into how people use online services has started to focus on the distinction between <a href="http://tallblog.conted.ox.ac.uk/index.php/2008/07/23/not-natives-immigrants-but-visitors-residents/">‘visitors’ and ‘residents’</a> (proposed as a more useful replacement for the previous talk of ‘natives’ and ‘immigrants’). Those who use the web as a tool for specific tasks can be seen as visitors whereas those who spend a large amount of time online can be categorised as residents. Residents clearly have a strong need to maintain their online identity, since their presence on the web is an essential part of their overall social interactions. Visitors still need to manage identity, though their needs are more in the context of being able to authenticate themselves and the sites they visit.</p>
<p>This distinction between visitors and residents is in reality more of a spectrum of different behaviours, but it’s critical that publishers understand the common needs for identity and authentication services. As the participative read/write web returns with the move towards Web 2.0 services the traditional roles within publishing of author and reader are disrupted, and again this is exactly the point where online identity, and trust in that identity, is centrally important.</p>
<h2>Publisher platforms</h2>
<p>Current publisher platforms provide a good example of the patchwork of different services and approaches to authentication and identity management developed over the past decade. Most feature a combination of the following approaches:</p>
<ol>
<li>IP authentication to identify a user’s home institution. This works well for on-campus users as it allows invisible authentication and thus provides the lowest possible barrier to usage. However this scheme is useless for remote users.</li>
<li>Federated authentication to identify an individual user within an institution. This covers both the historical Athens protocol used within the UK and the Shibboleth protocol now gaining significant traction in the UK and America. This works equally well for on and off campus users at the cost of a sometimes difficult user experience.</li>
<li>Other username/password schemes. Individual identity outside of the institutional context is invariably handled by requiring users to create their own account within a publisher’s system.</li>
<li>Other remote access schemes. Schemes such as <a href="http://www.oclc.org/ezproxy/">EZproxy</a> or referrer authentication allow remote users to identify their institution to a publisher platform by first logging into their institutional home page. Again, usability is compromised by having to go through the institutional portal to gain access to the resource.</li>
<li>Personalisation log-in. In order to uniquely identify individual users within an organisation it has often been necessary to have a second level of log-in within a publisher platform. This requires users to create and maintain yet another identity within the publisher platform.</li>
</ol>
<p>These approaches cannot usually be combined or aggregated, which acts as a further barrier to usability and usage. For instance, to access individual purchases it may be necessary to log-out of the institutional account and log-in again as an individual user.</p>
<p>Publishers are increasingly looking for business models which allow them to combine different levels of institutional access rights (e.g. school, department and college) with individual access rights. The existing infrastructure is in many cases not able to support these new models and so new solutions are needed.</p>
<h2>Contributor identity</h2>
<p>As the distinction between reader and author becomes increasingly fluid both parties need to be able to manage their online identity to interact with the read/write web. The <a href="http://www.crossref.org/CrossTech/2009/02/an_interview_about_author_ids.html">CrossRef ContributorID</a> project is focused on providing an identity framework designed to address the issues around knowledge discovery and authentication for the scholarly publishing community.</p>
<h2>Conclusion</h2>
<p>I’ve attempted to provide a survey of the existing landscape here, but where Online Identity is going next is a much bigger question – not to mention our ideas for where it should end up! Next month I want to discuss the influence of trends associated with Web 2.0 and how they are affecting the field, as we move towards Online Identity 2.0.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.semantico.com/discovery-blog/2009/11/the-challenge-of-online-identity/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>
