|
List Info
Thread: Fw: Which databases can Google Scholar crawl?
|
|
| Fw: Which databases can Google Scholar
crawl? |
  United States |
2008-02-19 10:28:47 |
Kathryn
Silberger/ADM/Mar
ist
To
"B.G.
Sloan" <bgsloan2 yahoo.com>
02/19/2008 11:27
cc
AM
Subject
Re: [Web4lib] Which
databases can
Google Scholar
crawl?(Document
link: Kathryn
Silberger)
Alan's question, "Has anyone ever seen, or attempted, a
canonical list?" is
a good one. This could be an activity of some ALA
committee. It is
certainly a crucial step in being able to evaluate GS as an
information
source. It is important that as a profession we don't
become passive about
allowing publishers and aggregators to take control of
collection
development decisions.
Katy
Kathryn K. Silberger
Automation Resources Librarian
James A. Cannavino Library
Marist College
3399 North Road
Poughkeepsie, NY 12601
Kathryn.Silberger marist.edu
(845) 575-3000 x.2419
"B.G. Sloan"
<bgsloan2 yahoo.c
om>
To
Sent by: web4lib webjunction.org
web4lib-bounces w
cc
ebjunction.org
Subject
Re: [Web4lib] Which
databases can
02/19/2008 09:59 Google Scholar crawl?
AM
From the Wikipedia entry on Google Scholar:
"A significant problem with GS is the secrecy about
its coverage...GS
refuses to publish a list of scientific journals crawled,
and the frequency
of its updates is unknown. It is therefore impossible to
know how current
and/or exhaustive searches are in GS."
Bernie Sloan
Alan Cockerill <alan.cockerill jcu.edu.au> wrote:
Hi
Has anyone ever seen a list of what providers have 'opened
up' their
indexing and abstracting data to Google Scholar?
Clearly large free sources like Pubmed and Scirus can be
searched via
Scholar but what subscription-only dbs can be reliably
searched via
Scholar?
Common sense tells me only fulltext providers would allow
crawling of the
I&A sections of their products as Scholar opens another
channel to
potential
article purchasers.
Trial and error tells me that Emerald, JSTOR, Sage,
Informaworld and
Ingenta
(somewhat patchily) have done this, and local Oz publisher
RMIT publishing
has allowed a couple of its fulltext datasets to be crawled
by Google as
well.
Has anyone ever seen, or attempted, a canonical list?
Thanks, Alan.
Alan Cockerill
Library Technologies Coordinator
James Cook University, Cairns
PO Box 6811
CAIRNS QLD 4870
Phone: (07) 4042 1737
Fax: (07) 4042 1516
Email: Alan.Cockerill jcu.edu.au
http:/
/www.library.jcu.edu.au/Staff/alan.shtml
CRICOS Provider Code: 00117J (QLD)
_______________________________________________
Web4lib mailing list
Web4lib webjunction.org
http://lists.we
bjunction.org/web4lib/
---------------------------------
Be a better friend, newshound, and know-it-all with Yahoo!
Mobile. Try it
now.
_______________________________________________
Web4lib mailing list
Web4lib webjunction.org
http://lists.we
bjunction.org/web4lib/
_______________________________________________
Web4lib mailing list
Web4lib webjunction.org
http://lists.we
bjunction.org/web4lib/
|
|
| Re: Fw: Which databases can Google
Scholar crawl? |
  United States |
2008-02-19 14:11:20 |
At 11:28 AM 2/19/2008, Kathryn Silberger wrote:
>This could be an activity of some ALA committee.
Why don't we just expand Alan's original trial & error
approach?
If someone has a wiki page they could spare, the collective
web4lib
community could probably put together a reasonably thorough
listing
in a matter of hours.
Just pick a few databases from your collection, search
inurl:my.database.com (or various other methods) and post
your
findings/difficulties.
While not absolutely exhaustive it would give a pretty
reasonable
sense of the coverage for major databases.
I'd offer up a page myself, but being in a corporate library
I don't
have a public facing page to offer, anyone else?
--Will
_______________________________________________
Web4lib mailing list
Web4lib webjunction.org
http://lists.we
bjunction.org/web4lib/
|
|
| RE: Which databases can Google Scholar
crawl? |
  Australia |
2008-02-26 23:37:44 |
Does anyone know the status of what was the Crossref/Google
Pilot?
http://ww
w.crossref.org/crossrefsearch.html
It started in 2004 (I think) and Google was index Crossref
publisher
members' sites and publisher sites were putting search boxes
for it on their
sites, Nature's is here:
htt
p://www.nature.com/search/search_crossref.html
Near as I can tell it's just a google search with the
name/value pair
'restrict=crossref' tacked onto the google search URL.
Weirder yet, you can add the same restriction parameter to
Google Scholar
and get pretty neat results from the approximately 45
publishers involved
(and listed in the first link above).
This just gets murkier...
Cheers, Alan.
Alan Cockerill
Library Technologies Coordinator
James Cook University, Cairns
PO Box 6811
CAIRNS QLD 4870
Phone: (07) 4042 1737
Fax: (07) 4042 1516
Email: Alan.Cockerill jcu.edu.au
http:/
/www.library.jcu.edu.au/Staff/alan.shtml
CRICOS Provider Code: 00117J (QLD)
> -----Original Message-----
> From: web4lib-bounces webjunction.org
[mailto:web4lib-
> bounces webjunction.org] On Behalf Of Alan Cockerill
> Sent: Wednesday, 20 February 2008 8:45 AM
> To: web4lib webjunction.org
> Subject: RE: Fw: [Web4lib] Which databases can Google
Scholar crawl?
>
> Thanks Will, and everyone else who's contributed,
> We still have an issue in identifying the completeness
of coverage, but
> this
> list is a good start - if a few of us blog it maybe
it'll attract enough
> attention that Google will care enough to set us right?
(I know, I'm
> dreaming).
> Ciao,
> Alan.
>
> Alan Cockerill
> Library Technologies Coordinator
> James Cook University, Cairns
>
> PO Box 6811
> CAIRNS QLD 4870
> Phone: (07) 4042 1737
> Fax: (07) 4042 1516
> Email: Alan.Cockerill jcu.edu.au
> http:/
/www.library.jcu.edu.au/Staff/alan.shtml
>
> CRICOS Provider Code: 00117J (QLD)
>
>
> > -----Original Message-----
> > From: web4lib-bounces webjunction.org
[mailto:web4lib-
> > bounces webjunction.org] On Behalf Of Will Kurt
> > Sent: Wednesday, 20 February 2008 8:11 AM
> > To: Kathryn Silberger; web4lib webjunction.org
> > Subject: Re: Fw: [Web4lib] Which databases can
Google Scholar crawl?
> >
> > At 03:11 PM 2/19/2008, Will Kurt wrote:
> > >anyone else?
> > ... or that someone could be me
> > I've taken a few minutes and quickly setup a
phpWiki and put it on a
> > site that I run outside of work:
> >
> >
http://lib-bling.com/scholar/index.php?GoogleScholar
> >
> > I put in a few entries mainly as a demo, but
please add and see if we
> > can get a good list going. Knowing what is not
indexed is equally as
> > important as knowing what is. If enough people
contribute this could
> > be a really useful.
> > --Will
> >
> > _______________________________________________
> > Web4lib mailing list
> > Web4lib webjunction.org
> > http://lists.we
bjunction.org/web4lib/
>
> _______________________________________________
> Web4lib mailing list
> Web4lib webjunction.org
> http://lists.we
bjunction.org/web4lib/
_______________________________________________
Web4lib mailing list
Web4lib webjunction.org
http://lists.we
bjunction.org/web4lib/
|
|
| RE: Which databases can Google Scholar
crawl? |
  Australia |
2008-02-26 23:37:44 |
Does anyone know the status of what was the Crossref/Google
Pilot?
http://ww
w.crossref.org/crossrefsearch.html
It started in 2004 (I think) and Google was index Crossref
publisher
members' sites and publisher sites were putting search boxes
for it on their
sites, Nature's is here:
htt
p://www.nature.com/search/search_crossref.html
Near as I can tell it's just a google search with the
name/value pair
'restrict=crossref' tacked onto the google search URL.
Weirder yet, you can add the same restriction parameter to
Google Scholar
and get pretty neat results from the approximately 45
publishers involved
(and listed in the first link above).
This just gets murkier...
Cheers, Alan.
Alan Cockerill
Library Technologies Coordinator
James Cook University, Cairns
PO Box 6811
CAIRNS QLD 4870
Phone: (07) 4042 1737
Fax: (07) 4042 1516
Email: Alan.Cockerill jcu.edu.au
http:/
/www.library.jcu.edu.au/Staff/alan.shtml
CRICOS Provider Code: 00117J (QLD)
> -----Original Message-----
> From: web4lib-bounces webjunction.org
[mailto:web4lib-
> bounces webjunction.org] On Behalf Of Alan Cockerill
> Sent: Wednesday, 20 February 2008 8:45 AM
> To: web4lib webjunction.org
> Subject: RE: Fw: [Web4lib] Which databases can Google
Scholar crawl?
>
> Thanks Will, and everyone else who's contributed,
> We still have an issue in identifying the completeness
of coverage, but
> this
> list is a good start - if a few of us blog it maybe
it'll attract enough
> attention that Google will care enough to set us right?
(I know, I'm
> dreaming).
> Ciao,
> Alan.
>
> Alan Cockerill
> Library Technologies Coordinator
> James Cook University, Cairns
>
> PO Box 6811
> CAIRNS QLD 4870
> Phone: (07) 4042 1737
> Fax: (07) 4042 1516
> Email: Alan.Cockerill jcu.edu.au
> http:/
/www.library.jcu.edu.au/Staff/alan.shtml
>
> CRICOS Provider Code: 00117J (QLD)
>
>
> > -----Original Message-----
> > From: web4lib-bounces webjunction.org
[mailto:web4lib-
> > bounces webjunction.org] On Behalf Of Will Kurt
> > Sent: Wednesday, 20 February 2008 8:11 AM
> > To: Kathryn Silberger; web4lib webjunction.org
> > Subject: Re: Fw: [Web4lib] Which databases can
Google Scholar crawl?
> >
> > At 03:11 PM 2/19/2008, Will Kurt wrote:
> > >anyone else?
> > ... or that someone could be me
> > I've taken a few minutes and quickly setup a
phpWiki and put it on a
> > site that I run outside of work:
> >
> >
http://lib-bling.com/scholar/index.php?GoogleScholar
> >
> > I put in a few entries mainly as a demo, but
please add and see if we
> > can get a good list going. Knowing what is not
indexed is equally as
> > important as knowing what is. If enough people
contribute this could
> > be a really useful.
> > --Will
> >
> > _______________________________________________
> > Web4lib mailing list
> > Web4lib webjunction.org
> > http://lists.we
bjunction.org/web4lib/
>
> _______________________________________________
> Web4lib mailing list
> Web4lib webjunction.org
> http://lists.we
bjunction.org/web4lib/
_______________________________________________
Web4lib mailing list
Web4lib webjunction.org
http://lists.we
bjunction.org/web4lib/
|
|
| RE: RE: Which databases can Google
Scholar crawl? |
  Australia |
2008-02-28 17:40:44 |
As some follow up
Ed Pentz from CrossRef tells me that the Google/CrossRef
Search Pilot is on
hold, but as far as he is aware Google is still indexing the
45 publishers
involved (rules of indexing agreed between Google and
individual
publishers).
I'm still not sure whether Google Scholar uses that data.
But using the
restriction=crossref in vanilla google you get a mini google
scholar. The
list of publishers is:
* Alphamed Press
* American Institute of Physics
* American Physical Society
* American Psychiatric Publishing
* American Society for Biochemistry and Molecular
Biology
* American Society of Civil Engineers
* Annual Reviews
* Ashley Publications
* Association for Computing Machinery
* Austrian Academy of Sciences Press
* BioMed Central
* Blackwell Publishing
* BMJ Publishing Group
* Cambridge University Press
* Cold Spring Harbor Laboratory Press
* EDP Science
* FASEB
* IEEE
* INFORMS
* Institute of Organic Chemistry and Biochemistry,
Academy of Sciences
of the Czech Republic
* Institute of Physics Publishing
* International Union of Crystallography
* Investigative Ophthamology and Visual Science
* Institute of Pure and Applied Physics (IPAP)
* Journal of Clinical Oncology
* S. Karger AG
* Lawrence Erlbaum Associates
* Mary Ann Liebert
* Medicine Publishing Group
* Nature Publishing Group
* Oldenbourg Wissenschaftsverlag
* Oxford University Press
* Peeters Publishers
* PNAS
* RILEM Publications SARL
* Royal College of Psychiatrists
* Springer-Verlag
* Taylor & Francis
* Thieme Publishing Group
* University of California Press
* University of Chicago Press
* Vathek Publishing
* John Wiley & Sons
* Wolters Kluwer International Health & Science
* The World Bank
Alan Cockerill
Library Technologies Coordinator
James Cook University, Cairns
PO Box 6811
CAIRNS QLD 4870
Phone: (07) 4042 1737
Fax: (07) 4042 1516
Email: Alan.Cockerill jcu.edu.au
http:/
/www.library.jcu.edu.au/Staff/alan.shtml
CRICOS Provider Code: 00117J (QLD)
> -----Original Message-----
> From: web4lib-bounces webjunction.org
[mailto:web4lib-
> bounces webjunction.org] On Behalf Of Alan Cockerill
> Sent: Wednesday, 27 February 2008 3:38 PM
> To: web4lib webjunction.org
> Subject: [Web4lib] RE: Which databases can Google
Scholar crawl?
>
> Does anyone know the status of what was the
Crossref/Google Pilot?
> http://ww
w.crossref.org/crossrefsearch.html
> It started in 2004 (I think) and Google was index
Crossref publisher
> members' sites and publisher sites were putting search
boxes for it on
> their
> sites, Nature's is here:
> htt
p://www.nature.com/search/search_crossref.html
> Near as I can tell it's just a google search with the
name/value pair
> 'restrict=crossref' tacked onto the google search URL.
> Weirder yet, you can add the same restriction parameter
to Google Scholar
> and get pretty neat results from the approximately 45
publishers involved
> (and listed in the first link above).
> This just gets murkier...
> Cheers, Alan.
> Alan Cockerill
> Library Technologies Coordinator
> James Cook University, Cairns
> PO Box 6811
> CAIRNS QLD 4870
> Phone: (07) 4042 1737
> Fax: (07) 4042 1516
> Email: Alan.Cockerill jcu.edu.au
> http:/
/www.library.jcu.edu.au/Staff/alan.shtml
>
> CRICOS Provider Code: 00117J (QLD)
>
>
> > -----Original Message-----
> > From: web4lib-bounces webjunction.org
[mailto:web4lib-
> > bounces webjunction.org] On Behalf Of Alan
Cockerill
> > Sent: Wednesday, 20 February 2008 8:45 AM
> > To: web4lib webjunction.org
> > Subject: RE: Fw: [Web4lib] Which databases can
Google Scholar crawl?
> >
> > Thanks Will, and everyone else who's contributed,
> > We still have an issue in identifying the
completeness of coverage, but
> > this
> > list is a good start - if a few of us blog it
maybe it'll attract enough
> > attention that Google will care enough to set us
right? (I know, I'm
> > dreaming).
> > Ciao,
> > Alan.
> >
> > Alan Cockerill
> > Library Technologies Coordinator
> > James Cook University, Cairns
> >
> > PO Box 6811
> > CAIRNS QLD 4870
> > Phone: (07) 4042 1737
> > Fax: (07) 4042 1516
> > Email: Alan.Cockerill jcu.edu.au
> > http:/
/www.library.jcu.edu.au/Staff/alan.shtml
> >
> > CRICOS Provider Code: 00117J (QLD)
> >
> >
> > > -----Original Message-----
> > > From: web4lib-bounces webjunction.org
[mailto:web4lib-
> > > bounces webjunction.org] On Behalf Of Will Kurt
> > > Sent: Wednesday, 20 February 2008 8:11 AM
> > > To: Kathryn Silberger; web4lib webjunction.org
> > > Subject: Re: Fw: [Web4lib] Which databases
can Google Scholar crawl?
> > >
> > > At 03:11 PM 2/19/2008, Will Kurt wrote:
> > > >anyone else?
> > > ... or that someone could be me
> > > I've taken a few minutes and quickly setup a
phpWiki and put it on a
> > > site that I run outside of work:
> > >
> > >
http://lib-bling.com/scholar/index.php?GoogleScholar
> > >
> > > I put in a few entries mainly as a demo, but
please add and see if we
> > > can get a good list going. Knowing what is
not indexed is equally as
> > > important as knowing what is. If enough
people contribute this could
> > > be a really useful.
> > > --Will
> > >
> > >
_______________________________________________
> > > Web4lib mailing list
> > > Web4lib webjunction.org
> > > http://lists.we
bjunction.org/web4lib/
> >
> > _______________________________________________
> > Web4lib mailing list
> > Web4lib webjunction.org
> > http://lists.we
bjunction.org/web4lib/
>
> _______________________________________________
> Web4lib mailing list
> Web4lib webjunction.org
> http://lists.we
bjunction.org/web4lib/
_______________________________________________
Web4lib mailing list
Web4lib webjunction.org
http://lists.we
bjunction.org/web4lib/
|
|
| RE: RE: Which databases can Google
Scholar crawl? |
  Australia |
2008-02-28 17:40:44 |
As some follow up
Ed Pentz from CrossRef tells me that the Google/CrossRef
Search Pilot is on
hold, but as far as he is aware Google is still indexing the
45 publishers
involved (rules of indexing agreed between Google and
individual
publishers).
I'm still not sure whether Google Scholar uses that data.
But using the
restriction=crossref in vanilla google you get a mini google
scholar. The
list of publishers is:
* Alphamed Press
* American Institute of Physics
* American Physical Society
* American Psychiatric Publishing
* American Society for Biochemistry and Molecular
Biology
* American Society of Civil Engineers
* Annual Reviews
* Ashley Publications
* Association for Computing Machinery
* Austrian Academy of Sciences Press
* BioMed Central
* Blackwell Publishing
* BMJ Publishing Group
* Cambridge University Press
* Cold Spring Harbor Laboratory Press
* EDP Science
* FASEB
* IEEE
* INFORMS
* Institute of Organic Chemistry and Biochemistry,
Academy of Sciences
of the Czech Republic
* Institute of Physics Publishing
* International Union of Crystallography
* Investigative Ophthamology and Visual Science
* Institute of Pure and Applied Physics (IPAP)
* Journal of Clinical Oncology
* S. Karger AG
* Lawrence Erlbaum Associates
* Mary Ann Liebert
* Medicine Publishing Group
* Nature Publishing Group
* Oldenbourg Wissenschaftsverlag
* Oxford University Press
* Peeters Publishers
* PNAS
* RILEM Publications SARL
* Royal College of Psychiatrists
* Springer-Verlag
* Taylor & Francis
* Thieme Publishing Group
* University of California Press
* University of Chicago Press
* Vathek Publishing
* John Wiley & Sons
* Wolters Kluwer International Health & Science
* The World Bank
Alan Cockerill
Library Technologies Coordinator
James Cook University, Cairns
PO Box 6811
CAIRNS QLD 4870
Phone: (07) 4042 1737
Fax: (07) 4042 1516
Email: Alan.Cockerill jcu.edu.au
http:/
/www.library.jcu.edu.au/Staff/alan.shtml
CRICOS Provider Code: 00117J (QLD)
> -----Original Message-----
> From: web4lib-bounces webjunction.org
[mailto:web4lib-
> bounces webjunction.org] On Behalf Of Alan Cockerill
> Sent: Wednesday, 27 February 2008 3:38 PM
> To: web4lib webjunction.org
> Subject: [Web4lib] RE: Which databases can Google
Scholar crawl?
>
> Does anyone know the status of what was the
Crossref/Google Pilot?
> http://ww
w.crossref.org/crossrefsearch.html
> It started in 2004 (I think) and Google was index
Crossref publisher
> members' sites and publisher sites were putting search
boxes for it on
> their
> sites, Nature's is here:
> htt
p://www.nature.com/search/search_crossref.html
> Near as I can tell it's just a google search with the
name/value pair
> 'restrict=crossref' tacked onto the google search URL.
> Weirder yet, you can add the same restriction parameter
to Google Scholar
> and get pretty neat results from the approximately 45
publishers involved
> (and listed in the first link above).
> This just gets murkier...
> Cheers, Alan.
> Alan Cockerill
> Library Technologies Coordinator
> James Cook University, Cairns
> PO Box 6811
> CAIRNS QLD 4870
> Phone: (07) 4042 1737
> Fax: (07) 4042 1516
> Email: Alan.Cockerill jcu.edu.au
> http:/
/www.library.jcu.edu.au/Staff/alan.shtml
>
> CRICOS Provider Code: 00117J (QLD)
>
>
> > -----Original Message-----
> > From: web4lib-bounces webjunction.org
[mailto:web4lib-
> > bounces webjunction.org] On Behalf Of Alan
Cockerill
> > Sent: Wednesday, 20 February 2008 8:45 AM
> > To: web4lib webjunction.org
> > Subject: RE: Fw: [Web4lib] Which databases can
Google Scholar crawl?
> >
> > Thanks Will, and everyone else who's contributed,
> > We still have an issue in identifying the
completeness of coverage, but
> > this
> > list is a good start - if a few of us blog it
maybe it'll attract enough
> > attention that Google will care enough to set us
right? (I know, I'm
> > dreaming).
> > Ciao,
> > Alan.
> >
> > Alan Cockerill
> > Library Technologies Coordinator
> > James Cook University, Cairns
> >
> > PO Box 6811
> > CAIRNS QLD 4870
> > Phone: (07) 4042 1737
> > Fax: (07) 4042 1516
> > Email: Alan.Cockerill jcu.edu.au
> > http:/
/www.library.jcu.edu.au/Staff/alan.shtml
> >
> > CRICOS Provider Code: 00117J (QLD)
> >
> >
> > > -----Original Message-----
> > > From: web4lib-bounces webjunction.org
[mailto:web4lib-
> > > bounces webjunction.org] On Behalf Of Will Kurt
> > > Sent: Wednesday, 20 February 2008 8:11 AM
> > > To: Kathryn Silberger; web4lib webjunction.org
> > > Subject: Re: Fw: [Web4lib] Which databases
can Google Scholar crawl?
> > >
> > > At 03:11 PM 2/19/2008, Will Kurt wrote:
> > > >anyone else?
> > > ... or that someone could be me
> > > I've taken a few minutes and quickly setup a
phpWiki and put it on a
> > > site that I run outside of work:
> > >
> > >
http://lib-bling.com/scholar/index.php?GoogleScholar
> > >
> > > I put in a few entries mainly as a demo, but
please add and see if we
> > > can get a good list going. Knowing what is
not indexed is equally as
> > > important as knowing what is. If enough
people contribute this could
> > > be a really useful.
> > > --Will
> > >
> > >
_______________________________________________
> > > Web4lib mailing list
> > > Web4lib webjunction.org
> > > http://lists.we
bjunction.org/web4lib/
> >
> > _______________________________________________
> > Web4lib mailing list
> > Web4lib webjunction.org
> > http://lists.we
bjunction.org/web4lib/
>
> _______________________________________________
> Web4lib mailing list
> Web4lib webjunction.org
> http://lists.we
bjunction.org/web4lib/
_______________________________________________
Web4lib mailing list
Web4lib webjunction.org
http://lists.we
bjunction.org/web4lib/
|
|
[1-6]
|
|