|
List Info
Thread: Re: Help us test a new service
|
|
| Re: Help us test a new service |
  Brazil |
2007-02-07 12:46:14 |
Hi Sebastian,
I use zebra with XML/DC records for harvesting newspapers. I
have 221 newspapers. All the stories are structured using
dublin core with full text. I'd like to contribute, is there
any way?
The newspapers are from Brazil and others 20 around the
world (the main ones). ok. We do this daily, offering fresh
news and headlines.
Cheers,
Rondon Andrade
IOOP - Online Information Ltda.
www.ioop.com.br
> Hi guys,
>
> It's long been on our mind to use our favorite
standards to leverage
> the quickly growing repositories of open content out
there.
>
> The first step of this is the establishment of a new
Z39.50 server, at
> econtent.indexdata.com, port 210, which provides access
to the following
> logical databases:
>
> oaister -- Metadata from the OAIster OAI
service provider
> (http://oaist
er.umdl.umich.edu/o/oaister/), about 10 million
metadata
> records harvested from various open access archives.
>
> wikipedia -- title searching wikipedia titles,
abstracts, and
> links. Around 1,5 million records.
>
> oca-americana -- Full ( MARC
records for books scanned as part of
> the Internet Archive's OCA (Open Content Alliance)
initiative. There's
> around 50,000 books in this collection. Within a day or
two, we will be
> adding the remaining Text collections from the Internet
Archive,
> including Gutenberg scans and others. The OCA is
producing around a
> million new pages of high quality scans, searchable
PDFs and online
> books per month at the moment, so this is an exciting
collection to watch.
>
> dmoz -- Human-cataloged web resources.
Several million
> sites cataloged.
>
> All databases will return records in XML/DC and MARC,
of varying
> quality. We aim to keep our copies of these resources
updated on regular
> schedules. We are actively looking for new, interesting
repositories of
> open content to make available in this way. If you have
suggestions,
> please feel free to get in touch with us.
>
> The server suports a fairly basic set of USE
attributes, and the usual
> combinations, but since this is running on our Zebra
server, you can
> also add a 2=102 to any term to produce a
relevance-ranked result list.
> We'll post a website shortly with a more thorough list
of options, as
> well as ZeeRex-based descriptions of the resources.
>
> We will be adding SRU/W support within a week or so,
but I figured folks
> on this list wouldn't mind doing thing the traditional
way. I'm
> imagining that possible uses for this service might
include copying
> records for ebooks into your catalog, building
metasearch facilities for
> free content, etc.
>
> If you have questions, comments, ideas or suggestions,
please feel very
> welcome to send them to me. I'd love to hear what you
think.
>
> All the best,
>
> --Sebastian
>
> --
> Sebastian Hammer, Index Data
> quinn indexdata.com www.indexdata.com
> Ph: (603) 209-6853 Fax: (866) 383-4485
>
>
> _______________________________________________
> Yazlist mailing list
> Yazlist lists.indexdata.dk
> http://lists.indexdata.dk/cgi-bin/mailman/listinfo/yaz
list
>
_______________________________________________
Yazlist mailing list
Yazlist lists.indexdata.dk
http://lists.indexdata.dk/cgi-bin/mailman/listinfo/yaz
list
|
|
| Re: Help us test a new service |
  Denmark |
2007-02-07 13:54:50 |
Hi Rondon,
Thanks for this. There are different ways we could do this.
We are using
our metaproxy to tie different Zebra-based servers into one
virtual
server, providing some level of harmonization in therms of
search access
points and retrieval formats to hopefully make them all
easier to use.
If your Z server is publicly visible, we can provide access
to it
through the proxy, or share access information about it with
others. We
can also host a copy of your metadata on a Zebra
installation here if
you like -- this is what we're doing for the collections
below -- and
set up a regular update schedule -- I think it should be
daily, in your
case.. this would provide some protection for you in case
this service
becomes popular, which of course we hope it will.
All the best,
--Sebastian
Rondon Andrade wrote:
>Hi Sebastian,
>
>I use zebra with XML/DC records for harvesting
newspapers. I have 221 newspapers. All the stories are
structured using dublin core with full text. I'd like to
contribute, is there any way?
>The newspapers are from Brazil and others 20 around the
world (the main ones). ok. We do this daily, offering fresh
news and headlines.
>
>Cheers,
>
>Rondon Andrade
>IOOP - Online Information Ltda.
>www.ioop.com.br
>
>
>
>
>
>
>>Hi guys,
>>
>>It's long been on our mind to use our favorite
standards to leverage
>>the quickly growing repositories of open content out
there.
>>
>>The first step of this is the establishment of a new
Z39.50 server, at
>>econtent.indexdata.com, port 210, which provides
access to the following
>>logical databases:
>>
>>oaister -- Metadata from the OAIster OAI
service provider
>>(http://oaist
er.umdl.umich.edu/o/oaister/), about 10 million metadata
>>records harvested from various open access
archives.
>>
>>wikipedia -- title searching wikipedia
titles, abstracts, and
>>links. Around 1,5 million records.
>>
>>oca-americana -- Full ( MARC
records for books scanned as part of
>>the Internet Archive's OCA (Open Content Alliance)
initiative. There's
>>around 50,000 books in this collection. Within a day
or two, we will be
>>adding the remaining Text collections from the
Internet Archive,
>>including Gutenberg scans and others. The OCA is
producing around a
>>million new pages of high quality scans, searchable
PDFs and online
>>books per month at the moment, so this is an
exciting collection to watch.
>>
>>dmoz -- Human-cataloged web
resources. Several million
>>sites cataloged.
>>
>>All databases will return records in XML/DC and
MARC, of varying
>>quality. We aim to keep our copies of these
resources updated on regular
>>schedules. We are actively looking for new,
interesting repositories of
>>open content to make available in this way. If you
have suggestions,
>>please feel free to get in touch with us.
>>
>>The server suports a fairly basic set of USE
attributes, and the usual
>>combinations, but since this is running on our Zebra
server, you can
>>also add a 2=102 to any term to produce a
relevance-ranked result list.
>>We'll post a website shortly with a more thorough
list of options, as
>>well as ZeeRex-based descriptions of the resources.
>>
>>We will be adding SRU/W support within a week or so,
but I figured folks
>>on this list wouldn't mind doing thing the
traditional way. I'm
>>imagining that possible uses for this service might
include copying
>>records for ebooks into your catalog, building
metasearch facilities for
>>free content, etc.
>>
>>If you have questions, comments, ideas or
suggestions, please feel very
>>welcome to send them to me. I'd love to hear what
you think.
>>
>>All the best,
>>
>>--Sebastian
>>
>>--
>>Sebastian Hammer, Index Data
>>quinn indexdata.com www.indexdata.com
>>Ph: (603) 209-6853 Fax: (866) 383-4485
>>
>>
>>_______________________________________________
>>Yazlist mailing list
>>Yazlist lists.indexdata.dk
>>http://lists.indexdata.dk/cgi-bin/mailman/listinfo/yaz
list
>>
>>
>>
>
>
>_______________________________________________
>Yazlist mailing list
>Yazlist lists.indexdata.dk
>http://lists.indexdata.dk/cgi-bin/mailman/listinfo/yaz
list
>
>
>
>
--
Sebastian Hammer, Index Data
quinn indexdata.com www.indexdata.com
Ph: (603) 209-6853 Fax: (866) 383-4485
_______________________________________________
Yazlist mailing list
Yazlist lists.indexdata.dk
http://lists.indexdata.dk/cgi-bin/mailman/listinfo/yaz
list
|
|
[1-2]
|
|