OCLC, formerly known as the Online Computer Library Center, is a US-based "nonprofit, membership, computer library service and research organization dedicated to the public purposes of furthering access to the world's information and reducing information costs".
Founded in 1967, OCLC is a worldwide library cooperative, owned, governed and sustained by its members. OCLC serves 17,900 member institutions in 123 countries and territories. OCLC provides economical services to libraries to help manage their collections and services in a cost-effective way that scales.
OCLC's mission is to
establish, maintain and operate a computerized library network and to promote the evolution of library use, of libraries themselves and of librarianship, and to provide processes and products for the benefit of library users and libraries, including such objectives as increasing availability of library resources to individual library patrons and reducing the rate-of-rise of library per-unit costs, all for the fundamental public purpose of furthering ease of access to and use of the ever-expanding body of worldwide scientific, literary and educational knowledge and information.
OCLC also conducts research for the library community, and makes its research outcomes known through various publications. The organization advocates for "advancing research, scholarship, education, community development, information access, and global cooperation".
The OCLC network links members to its online infrastructure providing intelligent databases and a cooperative platform to collectively innovate and drive efficiency in metadata creation, interlibrary loan, digitization, discovery and delivery. OCLC provides bibliographic, abstract and full-text information to anyone.
OCLC and its member libraries cooperatively produce and maintain WorldCat—the OCLC Online Union Catalog, the largest international online public access catalog (OPAC) in the world. WorldCat has holding records from public and private libraries worldwide. Over half of the items in that catalog are non-English and over half are non-book.
Areas for collaboration with Wikipedia
Full text reliable sources
OCLC will publish an API in fall 2013 to connect editors to electronic full text available online through affiliated libraries.[needs update] This fulfillment service sends a query via the API to OCLC, which processes the query, affiliates the editor with a library, and checks WorldCat to see if the requested citation is available within the library's collection. If there is a match, the API returns a link to Wikipedia that either connects directly to the requested full text or to the library's OpenURL resolver. Simply put, OCLC can deliver full text sources directly to editors, but only if their IP address already gives them access to the source under existing contracts. For open access full-text sources, these can be displayed for anyone who finds them on the Web.
The WorldCat Search API can help editors to 1) check to see is a book cited in an article is available in a local library, and 2) conduct broader research within WorldCat to locate further resources from nearby institutions. This production service is currently used within Wikipedia's Book sources page among other resources. Simply put, OCLC can allow editors to search the library catalogues of nearby institutions without leaving their computer or even logging into a separate website.
The WorldCat Search API can also be used for editors to discover collections that libraries have digitized and registered in WorldCat. These e-collections can be searched to find and access original records which may be useful as primary or secondary sources, and also to identify materials such as images that may be copyright compatible to add to an article or image collection.
WorldCat already serves several institutions that are active in the Wikipedia Loves Libraries (WLL) program. By working with OCLC, we can reach out to these institutions to provide greater access for Wikipedia editors and also try to bring in many more OCLC members as WLL partners.
ISBN lookup tool
OCLC provides support to Wikimedia projects for looking up ISBNs in citations.
Mission alignment and mutual benefit
OCLC and Wikipedia are both non-profit organizations that seek the distribution of knowledge to humanity. Wikipedia targets individuals through articles, while OCLC targets libraries and those who would benefit from better research. As an encyclopedia bedrocked on reliable sources, access to the most comprehensive and up-to-date information on articles, books, and digital collections would be a powerful tool. Meanwhile, OCLC seeks to bring as many library institutions on board into its services and community, and sharing those institutions' collections with Wikipedia editors or readers amplifies the reach of each member library. Libraries have the mission of sharing their collections with as many people as possible; Wikipedia is where the majority of the world's readers are getting their information.
OCLC has numerous relationships with publishers. These publishers are exploring access models which are more open and which we have leveraged in The Wikipedia Library's account donations several times previously. OCLC may be able to introduce us to a magnitude more such partners and help us to form relationships with them.
We also already have some nice connections with OCLC through their Wikipedian-in-Residence Max Klein, who has been working with Merrilee Proffitt at OCLC.
Data licensing policies and relation with free culture
In November 2008, the Board of Directors of OCLC unilaterally issued a new Policy for Use and Transfer of WorldCat Records that would have required member libraries to include an OCLC policy note on their bibliographic records; the policy caused an uproar among librarian bloggers. The late free culture activist Aaron Swartz criticized OCLC's "attempt to monopolize library records" while "charg[ing] outrageous amounts" for its services. According to Swartz, OCLC also "played hardball" against Open Library (an alternative universal book index he was developing together with the Internet Archive), by "trying to cut off our funding, hurt our reputation, and pressur[ing] libraries not to cooperate". Within a few months, the library community had forced OCLC to retract its policy and to create a Review Board to consult with member libraries more transparently. In 2011, OCLC and Open Library collaborated to add OCLC Control Numbers to Open Library records. In August 2012, OCLC recommended that member libraries adopt the Open Data Commons Attribution (ODC-BY) license when sharing library catalog data, although some member libraries have explicit agreements with OCLC that they can publish catalog data using the CC0 Public Domain Dedication. Beginning in 2017, OCLC and the Internet Archive have collaborated to make the Internet Archive's records of digitized books available in WorldCat.
Wikipedia editors' private information must be fully respected and never used/held without consent. APIs can match up IP addresses with institutions; this address information is tightly controlled on Wikipedia and any sharing of information with a third party such as a library or university would have to be fully disclosed and opt-in only. OCLC has agreed to build in whatever privacy protections we require. Hosting the API on Wikimedia Cloud VPS could increasingly permit IP information to be used without ever being disclosed or visible to anyone at Cloud VPS, even administrators, as Cloud VPS builds in that capability. Ideologically, OCLC comes from the vigilantly privacy-conscious library field and brings the same ethic to protecting privacy as Wikipedia does. One precedent in this area is the Forward to libraries navigation box which John Mark Ockerbloom (User:JohnMarkOckerbloom) set up. This feature dealt with any related WMF and Cloud Services privacy issues—collaboration OCLC would not present any more significant challenges than that.
Any collaboration with Wikipedia has to respect Wikipedia's tremendous organizational reputation. Partnerships, even informal ones, cannot detract from that in any way. OCLC has offered to provide services informally, non-exclusively, free of charge, and without any branding whatsoever. What that might look like is a link on some Wikipedia page that says, "Find a library" or "Full text source". OCLC need not ever be mentioned, and they are fine with that arrangement.
While access to digital and library catalogues would be useful, it's not a use case that is highly valuable or in demand among Wikipedians. The gold standard use case, and what we need to maximize and focus on is the situation where a Wikipedia editor is shown a link to a full text source only when that source is available without any extra authentication. To the extent that OCLC can provide direct full text access where it was otherwise unavailable (or only with difficulty), this collaboration becomes far more useful to us.
Tension with competitors
OCLC is a non-profit organization. However, it competes with for-profit library service providers, and it has not been immune from controversy. In 2010, a lawsuit was brought against OCLC by SkyRiver and Innovative Interfaces that accused OCLC of "monopolistic practices"; the lawsuit was later dismissed without any findings. The appearance of a collaboration with OCLC will have to be taken into account in any work that we do with them going forward. Generally speaking, OCLC, unlike its competitors, is application neutral and content neutral. There is no pay-for-placement in their database and there is no requirement that certain programs be used. OCLC has also recently built more positive relationships with competitors EBSCO Information Services, Gale, and ProQuest—as they serve the same users despite their differing models of non-profit versus for-profit.
All decisions on Wikipedia come from the community and nothing can happen without community support. Work with OCLC will begin in an open phase of discussion to find areas of potential collaboration, then OCLC would demo their services. If what they show us seems useful, we can look into setting up API access on Wikimedia Cloud VPS, or on a Wikipedia-space page. Further integration will require extended community discussions and appropriate consensus.
Parts of this Wikipedia page (those related to next steps) need to be updated. Please update this Wikipedia page to reflect recent events or newly available information. Relevant discussion may be found on the talk page.
Address each of the above concerns fully
Build a demo which can access the APIs
Test data to measure percentage of "gold standard" hits, direct full text source access as a percentage of tested citations
Map access to determine geographic impact
Configure the APIs to maximally respect privacy
Set up access through Wikimedia Cloud VPS
Hold an on-wiki discussion about hosting a page on English Wikipedia which could access the API