This website does readability filtering of other pages. All styles, scripts, forms and ads are stripped. If you want your website excluded or have other feedback, use this form.

GitHub - SeerLabs/CiteSeerX: CiteSeerX public repository

Skip to content

Sign in Sign up

SeerLabs / CiteSeerX

Join GitHub today

GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.

Sign up CiteSeerX public repository HTML Java JavaScript Python Perl CSS Other
  1. HTML 52.0%
  2. Java 29.2%
  3. JavaScript 9.6%
  4. Python 4.7%
  5. Perl 3.2%
  6. CSS 0.7%
  7. Other 0.6%
Branch: master Find file Clone or download

Clone with HTTPS

Use Git or checkout with SVN using the web URL.

Open in Desktop Download ZIP


Want to be notified of new releases in SeerLabs/CiteSeerX?

Sign in Sign up

Launching GitHub Desktop

If nothing happens, download GitHub Desktop and try again.

Launching GitHub Desktop

If nothing happens, download GitHub Desktop and try again.

Launching Xcode

If nothing happens, download Xcode and try again.

Launching Visual Studio

If nothing happens, download the GitHub extension for Visual Studio and try again.

Latest commit

fanchyna Merge pull request #79 from SeerLabs/CiteSeerX_grobid_handling
Cite seer x grobid handling
Latest commit 49ecb50 Nov 25, 2019


Permalink TypeNameLatest commit messageCommit time Failed to load latest commit information. bin Create Dec 9, 2016 bootstrap Initial GitHub import Jul 11, 2013 conf Add repositoryService property to citeseerx object Aug 17, 2016 crawler doc update cxm Aug 19, 2015 install Initial GitHub import Jul 11, 2013 lib merging master Feb 2, 2016 repository_api repository_api/ Douglas' production version Aug 24, 2016 resources solr schema: add field vtime back Jun 10, 2015 src enhancements related to grobid handling Nov 25, 2019 web Changed Line 15 - Commented close tag for c:if Mar 13, 2018 .gitignore importing PDFs in WARC files Jun 26, 2019 LICENSE.txt Create LICENSE.txt Jan 5, 2018 Update Jul 11, 2013 build.xml add includeantruntime to build.xml,increase download limit to 1000 Apr 2, 2014


This is the source code for the CiteSeerX academic digital library.

If you are interested in making contributions to CiteSeerX, please fork this repository and request to have your changes integrated via pull requests.

The code in the master branch should always be in a production-ready state so that it can be deployed at any time, while any other experimental code or code still in development should be in separate branches and merged into the master branch via pull requests.

You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session.