• Welcome to Resource Zone.

Where Could I Get The Latest Data Dump (Before It Closed)?

Discussion in 'Using Directory Data' started by Graph, Apr 18, 2017.

  1. Graph

    Graph New Member

    2
    0
    1
    Apr 13, 2017
    As you all know, DMOZ closed last month. And I want a copy of the DMOZ data before it closed. I searched DMOZ.org for the latest dump of the data but all it shows up is the "We're closed" page.

    Where could I get the latest dump before it closed?

    I'm creating a successor to DMOZ so I need this data.

    Also, is the content from this site, dmoztools.net, using the latest data dump or does it contain updates after the latest dump? Also, does dmoztools.net contain anything besides the latest dump?
     
    Last edited: Apr 18, 2017
  2. Graph

    Graph New Member

    2
    0
    1
    Apr 13, 2017
    I have heard that some people scraped the data before DMOZ shut down: seobook.com/dmoz-shut-down

    So I'm unsure whether the scraped data is more up-to-date than the RDF data dump.

    I've also heard that the links submissions on the old DMOZ that were not yet approved will be passed on the new official successor to be approved? Is it true? If it's true then that's data that is not available to me.
     
    Last edited: Apr 18, 2017
  3. pvgool

    pvgool DMOZ Meta Curlie Meta

    10,087
    43
    48
    Oct 8, 2002
    In this thread you can find a link to a copy of the last rdf

    > is the content from this site, dmoztools.net, using the latest data dump
    Yes. But please do not scrape that website.
    > or does it contain updates after the latest dump?
    > does dmoztools.net contain anything besides the latest dump?
    No, it is just a copy based on the last dump.

    > So I'm unsure whether the scraped data is more up-to-date than the RDF data dump.
    probably the dump is more up-to-date as it was created just before shutting down

    > I've also heard that the links submissions on the old DMOZ that were not yet approved
    > will be passed on the new official successor to be approved? Is it true?
    It is our (the editor community) intention to start a new directory based on the old DMOZ data, including all suggestions that were not approved yet.
    So, yes it is true.
    > If it's true then that's data that is not available to me.
    Correct, that data is not available to you
     
  4. Rz Roth

    Rz Roth New Member

    1
    0
    1
    Jul 27, 2017
    The link in that thread is to curlz.org/dmoz_rdf/ -- which is no longer a valid domain
    AND archive.org does not archive the data files
    SO any other suggestions for data dumps ??

    I have been using minimoz to add dmoz links as added information for some of my web sites
    and missed the closing to grab the last data dump

    Someone please help -- I will also host those dumps if that is helpful to the project.
     
  5. stillbuyvhs

    stillbuyvhs Editor

    21
    4
    3
    Mar 30, 2017
  6. informator

    informator DMOZ Meta Curlie Meta

    1,497
    25
    48
    Aug 19, 2003
    The data files are intact at archive.org ( content, structure, categories)...
     
  7. clement116

    clement116 New Member

    1
    0
    1
    Jul 28, 2017
    Hello everyone,

    Both the links in dmoztool and the mirror link in curlz_org does not work for me. Where can I find the lastest structure.rdf.u8.gz? Could anyone help with this?

    Best wishes
    Clement
     
  8. informator

    informator DMOZ Meta Curlie Meta

    1,497
    25
    48
    Aug 19, 2003
    See the link in post # 5, it is working

    We do not have any connection with curlz.org and how it is working.

    Dmoztools is only a static copy of the directory listings and not the whole old dmoz site.
     

Share This Page