List Info

Thread: Re: Help us test a new service




Re: Help us test a new service
country flaguser name
Brazil
2007-02-07 12:46:14
Hi Sebastian,

I use zebra with XML/DC records for harvesting newspapers. I
have 221 newspapers. All the stories are structured using
dublin core with full text. I'd like to contribute, is there
any way?
The newspapers are from Brazil and others 20 around the
world (the main ones). ok. We do this daily, offering fresh
news and headlines.

Cheers,

Rondon Andrade
IOOP - Online Information Ltda.
www.ioop.com.br




> Hi guys,
>
> It's long been on our mind to use our favorite
standards to  leverage
> the quickly growing repositories of open content out
there.
>
> The first step of this is the establishment of a new
Z39.50 server, at 
> econtent.indexdata.com, port 210, which provides access
to the following
> logical databases:
>
> oaister             -- Metadata from the OAIster OAI
service provider
> (http://oaist
er.umdl.umich.edu/o/oaister/), about 10 million
metadata
> records harvested from various open access archives.
>
> wikipedia        -- title searching wikipedia titles,
abstracts, and
> links. Around 1,5 million records.
>
> oca-americana  -- Full (  MARC
records for books scanned as part of 
> the Internet Archive's OCA (Open Content Alliance)
initiative. There's 
> around 50,000 books in this collection. Within a day or
two, we will be
> adding the remaining Text collections from the Internet
Archive,
> including Gutenberg scans and others. The OCA is
producing around a
> million new pages of high quality scans, searchable
PDFs and online
> books per month at the moment, so this is an exciting
collection to watch.
>
> dmoz                -- Human-cataloged web resources.
Several million
> sites cataloged.
>
> All databases will return records in XML/DC and MARC,
of varying
> quality. We aim to keep our copies of these resources
updated on regular
> schedules. We are actively looking for new, interesting
repositories of
> open content to make available in this way. If you have
suggestions,
> please feel free to get in touch with us.
>
> The server suports a fairly basic set of USE
attributes, and the usual 
> combinations, but since this is running on our Zebra
server, you can
> also add a 2=102 to any term to produce a
relevance-ranked result list.
> We'll post a website shortly with a more thorough list
of options, as
> well as ZeeRex-based descriptions of the resources.
>
> We will be adding SRU/W support within a week or so,
but I figured folks
> on this list wouldn't mind doing thing the traditional
way. I'm
> imagining that possible uses for this service might
include copying
> records for ebooks into your catalog, building
metasearch facilities for
> free content, etc.
>
> If you have questions, comments, ideas or suggestions,
please feel very
> welcome to send them to me. I'd love to hear what you
think.
>
> All the best,
>
> --Sebastian
>
> --
> Sebastian Hammer, Index Data
> quinnindexdata.com   www.indexdata.com
> Ph: (603) 209-6853 Fax: (866) 383-4485
>
>
> _______________________________________________
> Yazlist mailing list
> Yazlistlists.indexdata.dk
> http://lists.indexdata.dk/cgi-bin/mailman/listinfo/yaz
list
>


_______________________________________________
Yazlist mailing list
Yazlistlists.indexdata.dk
http://lists.indexdata.dk/cgi-bin/mailman/listinfo/yaz
list

Re: Help us test a new service
country flaguser name
Denmark
2007-02-07 13:54:50
Hi Rondon,

Thanks for this. There are different ways we could do this.
We are using 
our metaproxy to tie different Zebra-based servers into one
virtual 
server, providing some level of harmonization in therms of
search access 
points and retrieval formats to hopefully make them all
easier to use.

If your Z server is publicly visible, we can provide access
to it 
through the proxy, or share access information about it with
others. We 
can also host a copy of your metadata on a Zebra
installation here if 
you like -- this is what we're doing for the collections
below -- and 
set up a regular update schedule -- I think it should be
daily, in your 
case.. this would provide some protection for you in case
this service 
becomes popular, which of course we hope it will.

All the best,

--Sebastian

Rondon Andrade wrote:

>Hi Sebastian,
>
>I use zebra with XML/DC records for harvesting
newspapers. I have 221 newspapers. All the stories are
structured using dublin core with full text. I'd like to
contribute, is there any way?
>The newspapers are from Brazil and others 20 around the
world (the main ones). ok. We do this daily, offering fresh
news and headlines.
>
>Cheers,
>
>Rondon Andrade
>IOOP - Online Information Ltda.
>www.ioop.com.br
>
>
>
>
>  
>
>>Hi guys,
>>
>>It's long been on our mind to use our favorite
standards to  leverage 
>>the quickly growing repositories of open content out
there.
>>
>>The first step of this is the establishment of a new
Z39.50 server, at 
>>econtent.indexdata.com, port 210, which provides
access to the following 
>>logical databases:
>>
>>oaister             -- Metadata from the OAIster OAI
service provider 
>>(http://oaist
er.umdl.umich.edu/o/oaister/), about 10 million metadata

>>records harvested from various open access
archives.
>>
>>wikipedia        -- title searching wikipedia
titles, abstracts, and 
>>links. Around 1,5 million records.
>>
>>oca-americana  -- Full (  MARC
records for books scanned as part of 
>>the Internet Archive's OCA (Open Content Alliance)
initiative. There's 
>>around 50,000 books in this collection. Within a day
or two, we will be 
>>adding the remaining Text collections from the
Internet Archive, 
>>including Gutenberg scans and others. The OCA is
producing around a 
>>million new pages of high quality scans, searchable
PDFs and online 
>>books per month at the moment, so this is an
exciting collection to watch.
>>
>>dmoz                -- Human-cataloged web
resources. Several million 
>>sites cataloged.
>>
>>All databases will return records in XML/DC and
MARC, of varying 
>>quality. We aim to keep our copies of these
resources updated on regular 
>>schedules. We are actively looking for new,
interesting repositories of 
>>open content to make available in this way. If you
have suggestions, 
>>please feel free to get in touch with us.
>>
>>The server suports a fairly basic set of USE
attributes, and the usual 
>>combinations, but since this is running on our Zebra
server, you can 
>>also add a 2=102 to any term to produce a
relevance-ranked result list. 
>>We'll post a website shortly with a more thorough
list of options, as 
>>well as ZeeRex-based descriptions of the resources.
>>
>>We will be adding SRU/W support within a week or so,
but I figured folks 
>>on this list wouldn't mind doing thing the
traditional way. I'm 
>>imagining that possible uses for this service might
include copying 
>>records for ebooks into your catalog, building
metasearch facilities for 
>>free content, etc.
>>
>>If you have questions, comments, ideas or
suggestions, please feel very 
>>welcome to send them to me. I'd love to hear what
you think.
>>
>>All the best,
>>
>>--Sebastian
>>
>>-- 
>>Sebastian Hammer, Index Data
>>quinnindexdata.com   www.indexdata.com
>>Ph: (603) 209-6853 Fax: (866) 383-4485
>>
>>
>>_______________________________________________
>>Yazlist mailing list
>>Yazlistlists.indexdata.dk
>>http://lists.indexdata.dk/cgi-bin/mailman/listinfo/yaz
list
>>
>>    
>>
>
>
>_______________________________________________
>Yazlist mailing list
>Yazlistlists.indexdata.dk
>http://lists.indexdata.dk/cgi-bin/mailman/listinfo/yaz
list
>
>
>  
>

-- 
Sebastian Hammer, Index Data
quinnindexdata.com   www.indexdata.com
Ph: (603) 209-6853 Fax: (866) 383-4485


_______________________________________________
Yazlist mailing list
Yazlistlists.indexdata.dk
http://lists.indexdata.dk/cgi-bin/mailman/listinfo/yaz
list

[1-2]

about | contact  Other archives ( Real Estate discussion Medical topics )