Welcome to Resource Zone.

DMOZ Metadata

mingluwang

Member
Joined
Jan 29, 2009
Dear Editors,

I am currently working on a WiderNet project (www.widernet.org), and we are having a eGranary digital library, which containes many whole websites (http://www.widernet.org/digitalLibrary/content/WhatsInside.asp). We need to gather the metadata for our websites, and we think DMOZ might have them in their database.

So, is it the RDF that we need to download? What kinds of information is in it? Is there a sample record for a single item?

What are the tools that are available for tranforming the data file into SQL files?

Thank you!
 

pvgool

kEditall/kCatmv
Curlie Meta
Joined
Oct 8, 2002
I have no idea what you mean with "metadata".
DMOZ only has the url of the websites we have listed and a title and description as written by us.
Information about the RDF cen be found on http://rdf.dmoz.org/ , inlcuding a small sample.
But remember that when you use the DMOZ data you must follow the license agreement as presented on https://curlie.org/license.html
 

windharp

Meta/kMeta
Curlie Meta
Joined
Apr 30, 2002
So, is it the RDF that we need to download?
Assuming you are talking about a large number of samples you want to test: Yes, the content.* rdf file is what you need. be prepared that ist has a few GB uncompressed. If you are only talking about a few URLs you want to check manually, use the dmoz.org onsite search, omitting www. or other precfixes, searching for the domain only.

It is somewhat like XML, but unfortunately a very early stage of the RDF specification, which renders it unreadable for common parsers. But due to the syntax a semi skilled programmer should be able to parse the file easily.

What are the tools that are available for tranforming the data file into SQL files?
Not that many, sorry to say. The resources we know are listed in https://curlie.org/Computers/Intern...rectory_Project/Use_of_ODP_Data/Upload_Tools/ - maybe one of those links can help you.
 

weddingeye

Member
Joined
Apr 25, 2010
Could you please explain me more in details regarding DMOZ matadata? I am new to that project and would like to get more info or online reference pages with information regardign that topic. Please let me know at your earliest convinience.
 
Top Bottom