This website does readability filtering of other pages. All styles, scripts, forms and ads are stripped. If you want your website excluded or have other feedback, use this form.

Fix Broken Links Web Crawls : Free Web : Free Download, Borrow and Streaming : Internet Archive

Skip to main content Search the history of over 357 billion web pages on the Internet.

Featured
texts All Texts latest This Just In Smithsonian Libraries FEDLINK (US) Genealogy Lincoln Collection Additional Collections Books to Borrow
Top
American Libraries Canadian Libraries Universal Library Community Texts Project Gutenberg Biodiversity Heritage Library Children's Library Open Library Books by Language
Featured
movies All Video latest This Just In Prelinger Archives Democracy Now! Occupy Wall Street TV NSA Clip Library TV News
Top
Animation & Cartoons Arts & Music Community Video Computers & Technology Cultural & Academic Films Ephemeral Films Movies Understanding 9/11 News & Public Affairs Spirituality & Religion Sports Videos Television Videogame Videos Vlogs Youth Media
Featured
audio All Audio latest This Just In Grateful Dead Netlabels Old Time Radio 78 RPMs and Cylinder Recordings Live Music Archive
Top
Audio Books & Poetry Community Audio Computers & Technology Music, Arts & Culture News & Public Affairs Non-English Audio Radio Programs Librivox Free Audiobook Spirituality & Religion Podcasts
Featured
software All Software latest This Just In Old School Emulation MS-DOS Games Historical Software Classic PC Games Software Library Internet Arcade
Top
Community Software MS-DOS Kodi Archive and Support File CD-ROM Software APK CD-ROM Software Library Vintage Software Console Living Room Software Sites Tucows Software Library Shareware CD-ROMs ZX Spectrum DOOM Level CD ZX Spectrum Library: Games CD-ROM Images
Featured
image All Image latest This Just In Flickr Commons Occupy Wall Street Flickr Cover Art USGS Maps Metropolitan Museum
Top
NASA Images Solar System Collection Ames Research Center Brooklyn Museum

Fix Broken Links Web Crawls

These crawls are part of an effort to archive pages as they are created and archive the pages that they refer to. That way, as the pages that are referenced are changed or taken from the web, a link to the version that was live when the page was written will be preserved.

Then the Internet Archive hopes that references to these archived pages will be put in place of a link that would be otherwise be broken, or a companion link to allow people to see what was originally intended by a page's authors.

The goal is to fix all broken links on the web. Crawls of supported "No More 404" sites. MORE share Share
No_Favorite Favorite
edit Edit
time History
ABOUT COLLECTION

Share This Collection


94,131 RESULTS rss


Media Type
4 collections 94,103 web 24 data
Year
6,320 2019 16,285 2018 20,893 2017 26,524 2016 10,741 2015 8,544 2014 More right-solid
Topics & Subjects
94,103 crawldata 44,842 no404 25,265 wordpress 18,823 wikipedia 754 search 1 GDELT More right-solid
Collection
94,131 Fix Broken Links Web Crawls 94,130 Web Crawls 49,261 GDELT 25,289 Wordpress Blogs and the Pages They Link To 18,183 Wikipedia Near Real Time (from IRC) 2,265 Internet Archive Web Crawls More right-solid
Creator
94,103 internet archive SHOW DETAILS up-solid down-solid SORT BY VIEWS TITLE DATE ARCHIVED DATE PUBLISHED DATE REVIEWED CREATOR eye Title Date Archived Creator 632.1M 632M Wikipedia Near Real Time (from IRC) collection 18,180 ITEMS 632.1M VIEWS Sep 23, 2013 09/13 collection
eye 632.1M
This is a collection of web page captures from links added to, or changed on, Wikipedia pages. The idea is to bring a reliability to Wikipedia outlinks so that if the pages referenced by Wikipedia articles are changed, or go away, a reader can permanently find what was originally referred to. This is part of the Internet Archive's attempt to rid the web of broken links .
Topics: Wikipedia, Wikimedia
387.6M 388M GDELT collection 49,251 ITEMS 387.6M VIEWS Aug 27, 2014 08/14 collection
eye 387.6M
A daily crawl of more than 200,000 home pages of news sites, including the pages linked from those home pages. Site list provided by The GDELT Project
Topics: GDELT, News
276.9M 277M Wordpress Blogs and the Pages They Link To collection 25,281 ITEMS 276.9M VIEWS Sep 11, 2013 09/13 collection
eye 276.9M
This is a collection of pages and embedded objects from WordPress blogs and the external pages they link to. Captures of these pages are made on a continuous basis seeded from a feed of new or changed pages hosted by Wordpress.com or by Wordpress pages hosted by sites running a properly configured Jetpack wordpress plugin.
Topics: Wordpress.com, blogs, jetpack
Wordpress Blogs and the Pages They Link To 18M 18M Webwide Crawldata 2018-06-02T10:26:38PDT to 2018-06-02T13:37:35PDT Jun 2, 2018 06/18 by Internet Archive web
eye 18M
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by wwwb-crawl08.us.archive.org:no404 from Sat Jun 2 10:26:38 PDT 2018 to Sat Jun 2 13:37:35 PDT 2018.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC) 10.4M 10M Webwide Crawldata 2017-05-05T03:05:22PDT to 2017-05-05T20:22:10PDT May 6, 2017 05/17 by Internet Archive web
eye 10.4M
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Fri May 5 03:05:22 PDT 2017 to Fri May 5 20:22:10 PDT 2017.
Topics: no404, wikipedia, crawldata
GDELT 3.3M 3.3M Webwide Crawldata 2018-06-25T14:47:13PDT to 2018-06-30T23:44:27PDT Jul 1, 2018 07/18 by Internet Archive web
eye 3.3M
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Mon Jun 25 14:47:13 PDT 2018 to Sat Jun 30 23:44:27 PDT 2018.
Topic: crawldata
GDELT 1.7M 1.7M Webwide Crawldata 2017-05-06T16:20:29PDT to 2017-05-06T10:25:37PDT May 6, 2017 05/17 by Internet Archive web
eye 1.7M
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sat May 6 16:20:29 PDT 2017 to Sat May 6 10:25:37 PDT 2017.
Topic: crawldata
GDELT 1.5M 1.5M Webwide Crawldata 2017-06-10T10:04:42PDT to 2017-06-10T04:04:56PDT Jun 10, 2017 06/17 by Internet Archive web
eye 1.5M
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sat Jun 10 10:04:42 PDT 2017 to Sat Jun 10 04:04:56 PDT 2017.
Topic: crawldata
GDELT 1.5M 1.5M Webwide Crawldata 2017-06-10T16:37:34PDT to 2017-06-10T11:06:28PDT Jun 10, 2017 06/17 by Internet Archive web
eye 1.5M
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sat Jun 10 16:37:34 PDT 2017 to Sat Jun 10 11:06:28 PDT 2017.
Topic: crawldata
GDELT 1.5M 1.5M Webwide Crawldata 2017-06-11T02:32:54PDT to 2017-06-10T20:55:43PDT Jun 11, 2017 06/17 by Internet Archive web
eye 1.5M
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sun Jun 11 02:32:54 PDT 2017 to Sat Jun 10 20:55:43 PDT 2017.
Topic: crawldata
GDELT 1.5M 1.5M Webwide Crawldata 2017-06-10T21:26:19PDT to 2017-06-10T16:13:06PDT Jun 10, 2017 06/17 by Internet Archive web
eye 1.5M
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sat Jun 10 21:26:19 PDT 2017 to Sat Jun 10 16:13:06 PDT 2017.
Topic: crawldata
GDELT 1.4M 1.4M Webwide Crawldata 2017-06-10T13:08:14PDT to 2017-06-10T06:59:57PDT Jun 10, 2017 06/17 by Internet Archive web
eye 1.4M
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sat Jun 10 13:08:14 PDT 2017 to Sat Jun 10 06:59:57 PDT 2017.
Topic: crawldata
GDELT 1M 1.0M Webwide Crawldata 2017-01-20T14:31:54PST to 2017-01-20T07:48:07PST Jan 25, 2017 01/17 by Internet Archive web
eye 1M
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Fri Jan 20 14:31:54 PST 2017 to Fri Jan 20 07:48:07 PST 2017.
Topic: crawldata
Wikipedia Near Real Time (from IRC) 977,597 978K Webwide Crawldata 2017-05-18T02:00:07PDT to 2017-05-18T01:34:36PDT May 18, 2017 05/17 by Internet Archive web
eye 977,597
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl896.us.archive.org:no404 from Thu May 18 02:00:07 PDT 2017 to Thu May 18 01:34:36 PDT 2017.
Topics: no404, wikipedia, crawldata
GDELT 951,182 951K Webwide Crawldata 2018-09-13T02:07:17PDT to 2018-09-12T20:49:14PDT Sep 13, 2018 09/18 by Internet Archive web
eye 951,182
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Thu Sep 13 02:07:17 PDT 2018 to Wed Sep 12 20:49:14 PDT 2018.
Topic: crawldata
GDELT 876,010 876K Webwide Crawldata 2018-01-28T18:51:48PST to 2018-01-28T12:11:14PST Jan 28, 2018 01/18 by Internet Archive web
eye 876,010
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Sun Jan 28 18:51:48 PST 2018 to Sun Jan 28 12:11:14 PST 2018.
Topic: crawldata
GDELT 865,655 866K Webwide Crawldata 2017-09-07T11:27:02PDT to 2017-09-07T06:24:47PDT Sep 7, 2017 09/17 by Internet Archive web
eye 865,655
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Thu Sep 7 11:27:02 PDT 2017 to Thu Sep 7 06:24:47 PDT 2017.
Topic: crawldata
GDELT 858,605 859K Webwide Crawldata 2018-01-28T19:25:21PST to 2018-01-28T14:31:24PST Jan 28, 2018 01/18 by Internet Archive web
eye 858,605
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Sun Jan 28 19:25:21 PST 2018 to Sun Jan 28 14:31:24 PST 2018.
Topic: crawldata
Wikipedia Near Real Time (from IRC) 792,793 793K Webwide Crawldata 2018-06-17T16:59:56PDT to 2018-06-17T22:30:01PDT Jun 18, 2018 06/18 by Internet Archive web
eye 792,793
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl896.us.archive.org:no404 from Sun Jun 17 16:59:56 PDT 2018 to Sun Jun 17 22:30:01 PDT 2018.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 781,477 781K Webwide Crawldata 2015-06-27T02:35:05PDT to 2015-06-26T21:13:31PDT Jun 28, 2015 06/15 by Internet Archive web
eye 781,477
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Jun 27 02:35:05 PDT 2015 to Fri Jun 26 21:13:31 PDT 2015.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 745,553 746K Webwide Crawldata 2018-06-15T15:53:52PDT to 2018-06-16T01:58:14PDT Jul 13, 2018 07/18 by Internet Archive web
eye 745,553
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl110.us.archive.org:no404 from Fri Jun 15 15:53:52 PDT 2018 to Sat Jun 16 01:58:14 PDT 2018.
Topics: no404, wikipedia, crawldata
GDELT 702,235 702K Webwide Crawldata 2015-07-16T10:27:47PDT to 2015-07-16T04:43:26PDT Jul 16, 2015 07/15 by Internet Archive web
eye 702,235
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Thu Jul 16 10:27:47 PDT 2015 to Thu Jul 16 04:43:26 PDT 2015.
Topic: crawldata
Wikipedia Near Real Time (from IRC) 649,633 650K Webwide Crawldata 2014-10-07T09:36:28PDT to 2014-10-07T05:34:58PDT Oct 7, 2014 10/14 by Internet Archive web
eye 649,633
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Tue Oct 7 09:36:28 PDT 2014 to Tue Oct 7 05:34:58 PDT 2014.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 645,142 645K Webwide Crawldata 2013-10-30T21:19:56PDT to 2013-10-30T15:58:29PDT Oct 31, 2013 10/13 by Internet Archive web
eye 645,142
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Wed Oct 30 21:19:56 PDT 2013 to Wed Oct 30 15:58:29 PDT 2013.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To 540,151 540K Webwide Crawldata 2015-10-11T05:26:52PDT to 2015-10-11T08:02:18PDT Oct 11, 2015 10/15 by Internet Archive web
eye 540,151
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Sun Oct 11 05:26:52 PDT 2015 to Sun Oct 11 08:02:18 PDT 2015.
Topics: no404, wordpress, crawldata
GDELT 508,928 509K Webwide Crawldata 2017-02-01T04:50:38PST to 2017-01-31T21:52:57PST Feb 1, 2017 02/17 by Internet Archive web
eye 508,928
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Wed Feb 1 04:50:38 PST 2017 to Tue Jan 31 21:52:57 PST 2017.
Topic: crawldata
GDELT 469,565 470K Webwide Crawldata 2018-06-13T07:06:46PDT to 2018-06-13T01:14:17PDT Jun 13, 2018 06/18 by Internet Archive web
eye 469,565
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Wed Jun 13 07:06:46 PDT 2018 to Wed Jun 13 01:14:17 PDT 2018.
Topic: crawldata
Wikipedia Near Real Time (from IRC) 403,805 404K Webwide Crawldata 2015-02-18T05:03:48PST to 2015-02-17T22:30:10PST Feb 18, 2015 02/15 by Internet Archive web
eye 403,805
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Wed Feb 18 05:03:48 PST 2015 to Tue Feb 17 22:30:10 PST 2015.
Topics: no404, wikipedia, crawldata
GDELT 387,834 388K Webwide Crawldata 2018-03-17T09:27:03PDT to 2018-03-17T06:00:27PDT Mar 17, 2018 03/18 by Internet Archive web
eye 387,834
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sat Mar 17 09:27:03 PDT 2018 to Sat Mar 17 06:00:27 PDT 2018.
Topic: crawldata
Wordpress Blogs and the Pages They Link To 381,404 381K Webwide Crawldata 2013-09-11T01:48:56PDT to 2013-09-10T19:39:40PDT Sep 13, 2013 09/13 by Internet Archive web
eye 381,404
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Wed Sep 11 01:48:56 PDT 2013 to Tue Sep 10 19:39:40 PDT 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC) 375,393 375K Webwide Crawldata 2017-06-06T09:58:02PDT to 2017-06-06T05:29:32PDT Jun 7, 2017 06/17 by Internet Archive web
eye 375,393
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl896.us.archive.org:no404 from Tue Jun 6 09:58:02 PDT 2017 to Tue Jun 6 05:29:32 PDT 2017.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 363,844 364K Webwide Crawldata 2017-07-12T03:57:55PDT to 2017-07-11T22:18:15PDT Jul 13, 2017 07/17 by Internet Archive web
eye 363,844
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl896.us.archive.org:no404 from Wed Jul 12 03:57:55 PDT 2017 to Tue Jul 11 22:18:15 PDT 2017.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 362,774 363K Webwide Crawldata 2013-10-11T18:57:27PDT to 2013-10-11T18:27:42PDT Oct 12, 2013 10/13 by Internet Archive web
eye 362,774
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Fri Oct 11 18:57:27 PDT 2013 to Fri Oct 11 18:27:42 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 358,219 358K Webwide Crawldata 2013-11-04T01:08:35PST to 2013-11-03T18:31:38PST Nov 4, 2013 11/13 by Internet Archive web
eye 358,219
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Mon Nov 4 01:08:35 PST 2013 to Sun Nov 3 18:31:38 PST 2013.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To 355,318 355K Webwide Crawldata 2013-12-09T02:29:07PST to 2013-12-08T19:51:18PST Dec 9, 2013 12/13 by Internet Archive web
eye 355,318
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Mon Dec 9 02:29:07 PST 2013 to Sun Dec 8 19:51:18 PST 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC) 339,219 339K Webwide Crawldata 2013-10-12T02:06:46PDT to 2013-10-11T20:57:12PDT Oct 12, 2013 10/13 by Internet Archive web
eye 339,219
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 02:06:46 PDT 2013 to Fri Oct 11 20:57:12 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 329,178 329K Webwide Crawldata 2015-01-11T20:40:00PST to 2015-01-11T17:30:17PST Jan 12, 2015 01/15 by Internet Archive web
eye 329,178
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Jan 11 20:40:00 PST 2015 to Sun Jan 11 17:30:17 PST 2015.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 318,739 319K Webwide Crawldata 2015-01-10T20:28:55PST to 2015-01-10T14:35:49PST Jan 10, 2015 01/15 by Internet Archive web
eye 318,739
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Jan 10 20:28:55 PST 2015 to Sat Jan 10 14:35:49 PST 2015.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 315,697 316K Webwide Crawldata 2013-10-12T05:10:05PDT to 2013-10-11T23:33:01PDT Oct 12, 2013 10/13 by Internet Archive web
eye 315,697
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 05:10:05 PDT 2013 to Fri Oct 11 23:33:01 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 309,797 310K Webwide Crawldata 2013-10-12T01:17:32PDT to 2013-10-11T19:35:18PDT Oct 12, 2013 10/13 by Internet Archive web
eye 309,797
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 01:17:32 PDT 2013 to Fri Oct 11 19:35:18 PDT 2013.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To 306,392 306K Webwide Crawldata 2017-01-10T14:41:43PST to 2017-01-10T10:59:29PST Jan 10, 2017 01/17 by Internet Archive web
eye 306,392
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Tue Jan 10 14:41:43 PST 2017 to Tue Jan 10 10:59:29 PST 2017.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC) 297,036 297K Webwide Crawldata 2013-10-12T03:09:48PDT to 2013-10-11T21:36:24PDT Oct 12, 2013 10/13 by Internet Archive web
eye 297,036
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 03:09:48 PDT 2013 to Fri Oct 11 21:36:24 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 292,017 292K Webwide Crawldata 2017-01-18T03:26:20PST to 2017-01-17T21:10:10PST Jan 18, 2017 01/17 by Internet Archive web
eye 292,017
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Wed Jan 18 03:26:20 PST 2017 to Tue Jan 17 21:10:10 PST 2017.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 289,175 289K Webwide Crawldata 2013-10-12T04:03:47PDT to 2013-10-11T22:24:49PDT Oct 12, 2013 10/13 by Internet Archive web
eye 289,175
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 04:03:47 PDT 2013 to Fri Oct 11 22:24:49 PDT 2013.
Topics: no404, wikipedia, crawldata
GDELT 287,780 288K Webwide Crawldata 2018-01-29T00:17:27PST to 2018-01-28T17:44:33PST Jan 29, 2018 01/18 by Internet Archive web
eye 287,780
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Mon Jan 29 00:17:27 PST 2018 to Sun Jan 28 17:44:33 PST 2018.
Topic: crawldata
Wordpress Blogs and the Pages They Link To 282,254 282K Webwide Crawldata 2018-10-31T22:29:30PDT to 2018-11-01T03:23:08PDT Nov 1, 2018 11/18 by Internet Archive web
eye 282,254
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl106.us.archive.org:no404 from Wed Oct 31 22:29:30 PDT 2018 to Thu Nov 1 03:23:08 PDT 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To 281,037 281K Webwide Crawldata 2018-11-01T00:49:53PDT to 2018-11-01T04:06:58PDT Nov 1, 2018 11/18 by Internet Archive web
eye 281,037
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl108.us.archive.org:no404 from Thu Nov 1 00:49:53 PDT 2018 to Thu Nov 1 04:06:58 PDT 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To 278,664 279K Webwide Crawldata 2018-11-01T06:43:55PDT to 2018-11-01T09:20:06PDT Nov 1, 2018 11/18 by Internet Archive web
eye 278,664
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl106.us.archive.org:no404 from Thu Nov 1 06:43:55 PDT 2018 to Thu Nov 1 09:20:06 PDT 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To 277,115 277K Webwide Crawldata 2018-11-01T08:13:40PDT to 2018-11-01T10:12:18PDT Nov 1, 2018 11/18 by Internet Archive web
eye 277,115
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl108.us.archive.org:no404 from Thu Nov 1 08:13:40 PDT 2018 to Thu Nov 1 10:12:18 PDT 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To 276,401 276K Webwide Crawldata 2018-11-01T02:25:04PDT to 2018-11-01T05:03:57PDT Nov 1, 2018 11/18 by Internet Archive web
eye 276,401
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl107.us.archive.org:no404 from Thu Nov 1 02:25:04 PDT 2018 to Thu Nov 1 05:03:57 PDT 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To 273,883 274K Webwide Crawldata 2013-11-08T18:07:43PST to 2013-11-08T11:24:54PST Nov 9, 2013 11/13 by Internet Archive web
eye 273,883
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Fri Nov 8 18:07:43 PST 2013 to Fri Nov 8 11:24:54 PST 2013.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To 269,199 269K Webwide Crawldata 2013-10-09T13:36:49PDT to 2013-10-09T07:59:25PDT Oct 10, 2013 10/13 by Internet Archive web
eye 269,199
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Wed Oct 9 13:36:49 PDT 2013 to Wed Oct 9 07:59:25 PDT 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC) 268,592 269K Webwide Crawldata 2013-09-22T02:43:39PDT to 2013-09-21T21:49:05PDT Sep 24, 2013 09/13 by Internet Archive web
eye 268,592
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Sep 22 02:43:39 PDT 2013 to Sat Sep 21 21:49:05 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 263,208 263K Webwide Crawldata 2013-09-21T22:25:59PDT to 2013-09-21T18:13:45PDT Sep 24, 2013 09/13 by Internet Archive web
eye 263,208
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Sep 21 22:25:59 PDT 2013 to Sat Sep 21 18:13:45 PDT 2013.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To 259,301 259K Webwide Crawldata 2013-10-07T06:39:20PDT to 2013-10-07T01:07:00PDT Oct 7, 2013 10/13 by Internet Archive web
eye 259,301
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Mon Oct 7 06:39:20 PDT 2013 to Mon Oct 7 01:07:00 PDT 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC) 255,348 255K Webwide Crawldata 2013-10-12T06:01:08PDT to 2013-10-12T00:24:12PDT Oct 12, 2013 10/13 by Internet Archive web
eye 255,348
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 06:01:08 PDT 2013 to Sat Oct 12 00:24:12 PDT 2013.
Topics: no404, wikipedia, crawldata
GDELT 255,161 255K Webwide Crawldata 2016-06-05T12:10:10PDT to 2016-06-05T06:42:33PDT Jun 5, 2016 06/16 by Internet Archive web
eye 255,161
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sun Jun 5 12:10:10 PDT 2016 to Sun Jun 5 06:42:33 PDT 2016.
Topic: crawldata
Wikipedia Near Real Time (from IRC) 254,143 254K Webwide Crawldata 2013-10-12T07:38:37PDT to 2013-10-12T02:15:16PDT Oct 12, 2013 10/13 by Internet Archive web
eye 254,143
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 07:38:37 PDT 2013 to Sat Oct 12 02:15:16 PDT 2013.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To 250,951 251K Webwide Crawldata 2013-12-02T21:56:25PST to 2013-12-02T15:29:07PST Dec 3, 2013 12/13 by Internet Archive web
eye 250,951
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Mon Dec 2 21:56:25 PST 2013 to Mon Dec 2 15:29:07 PST 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC) 250,691 251K Webwide Crawldata 2013-09-21T23:53:08PDT to 2013-09-21T19:42:29PDT Sep 24, 2013 09/13 by Internet Archive web
eye 250,691
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Sep 21 23:53:08 PDT 2013 to Sat Sep 21 19:42:29 PDT 2013.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To 249,481 249K Webwide Crawldata 2013-11-08T19:29:44PST to 2013-11-08T12:30:08PST Nov 9, 2013 11/13 by Internet Archive web
eye 249,481
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Fri Nov 8 19:29:44 PST 2013 to Fri Nov 8 12:30:08 PST 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC) 247,530 248K Webwide Crawldata 2013-10-13T22:05:29PDT to 2013-10-13T16:25:33PDT Oct 14, 2013 10/13 by Internet Archive web
eye 247,530
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Oct 13 22:05:29 PDT 2013 to Sun Oct 13 16:25:33 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 246,905 247K Webwide Crawldata 2018-09-24T07:08:06PDT to 2018-09-24T22:54:26PDT Sep 25, 2018 09/18 by Internet Archive web
eye 246,905
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl109.us.archive.org:no404 from Mon Sep 24 07:08:06 PDT 2018 to Mon Sep 24 22:54:26 PDT 2018.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 243,921 244K Webwide Crawldata 2014-10-26T06:02:13PDT to 2014-10-26T00:44:36PDT Oct 26, 2014 10/14 by Internet Archive web
eye 243,921
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Oct 26 06:02:13 PDT 2014 to Sun Oct 26 00:44:36 PDT 2014.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 243,690 244K Webwide Crawldata 2014-03-13T15:39:54PDT to 2014-03-13T10:29:53PDT Mar 13, 2014 03/14 by Internet Archive web
eye 243,690
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Thu Mar 13 15:39:54 PDT 2014 to Thu Mar 13 10:29:53 PDT 2014.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 240,746 241K Webwide Crawldata 2013-12-03T01:48:04PST to 2013-12-02T20:11:08PST Dec 3, 2013 12/13 by Internet Archive web
eye 240,746
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Tue Dec 3 01:48:04 PST 2013 to Mon Dec 2 20:11:08 PST 2013.
Topics: no404, wikipedia, crawldata
GDELT 238,630 239K Webwide Crawldata 2015-10-01T15:26:49PDT to 2015-10-01T09:43:18PDT Oct 1, 2015 10/15 by Internet Archive web
eye 238,630
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Thu Oct 1 15:26:49 PDT 2015 to Thu Oct 1 09:43:18 PDT 2015.
Topic: crawldata
Wikipedia Near Real Time (from IRC) 233,910 234K Webwide Crawldata 2013-10-12T11:08:48PDT to 2013-10-12T06:01:41PDT Oct 12, 2013 10/13 by Internet Archive web
eye 233,910
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 11:08:48 PDT 2013 to Sat Oct 12 06:01:41 PDT 2013.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To 233,769 234K Webwide Crawldata 2013-11-08T18:12:47PST to 2013-11-08T11:19:28PST Nov 9, 2013 11/13 by Internet Archive web
eye 233,769
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl458.us.archive.org:no404 from Fri Nov 8 18:12:47 PST 2013 to Fri Nov 8 11:19:28 PST 2013.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC) 233,559 234K Webwide Crawldata 2013-12-15T11:40:44PST to 2013-12-15T05:32:44PST Dec 15, 2013 12/13 by Internet Archive web
eye 233,559
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sun Dec 15 11:40:44 PST 2013 to Sun Dec 15 05:32:44 PST 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 231,549 232K Webwide Crawldata 2013-09-21T06:02:57PDT to 2013-09-21T01:17:57PDT Sep 23, 2013 09/13 by Internet Archive web
eye 231,549
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Sep 21 06:02:57 PDT 2013 to Sat Sep 21 01:17:57 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 231,156 231K Webwide Crawldata 2014-10-06T14:27:37PDT to 2014-10-06T10:01:54PDT Oct 6, 2014 10/14 by Internet Archive web
eye 231,156
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Mon Oct 6 14:27:37 PDT 2014 to Mon Oct 6 10:01:54 PDT 2014.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 230,537 231K Webwide Crawldata 2015-04-13T23:31:41PDT to 2015-04-13T18:01:32PDT Apr 14, 2015 04/15 by Internet Archive web
eye 230,537
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Mon Apr 13 23:31:41 PDT 2015 to Mon Apr 13 18:01:32 PDT 2015.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 229,372 229K Webwide Crawldata 2013-09-21T12:19:50PDT to 2013-09-21T07:00:20PDT Sep 23, 2013 09/13 by Internet Archive web
eye 229,372
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Sep 21 12:19:50 PDT 2013 to Sat Sep 21 07:00:20 PDT 2013.
Topics: no404, wikipedia, crawldata
Wikipedia Near Real Time (from IRC) 229,074 229K Webwide Crawldata 2013-10-12T10:13:29PDT to 2013-10-12T04:53:58PDT Oct 12, 2013 10/13 by Internet Archive web
eye 229,074
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Oct 12 10:13:29 PDT 2013 to Sat Oct 12 04:53:58 PDT 2013.
Topics: no404, wikipedia, crawldata
MORE RESULTS
Fetching more results DESCRIPTION These crawls are part of an effort to archive pages as they are created and archive the pages that they refer to. That way, as the pages that are referenced are changed or taken from the web, a link to the version that was live when the page was written will be preserved.

Then the Internet Archive hopes that references to these archived pages will be put in place of a link that would be otherwise be broken, or a companion link to allow people to see what was originally intended by a page's authors.

The goal is to fix all broken links on the web. Crawls of supported "No More 404" sites.
ACTIVITY

comment


Created on September 12
2013 ARossi
Archivist ADDITIONAL CONTRIBUTORS Wayback Machine Web Crawling
Archivist VIEWS — About the New Statistics

Total Views 1,345,496,786

DISCONTINUED VIEWS

Total Views 1,319,998,211

ITEMS

Total Items 94,110

TOP REGIONS (LAST 30 DAYS)

(data not available)