List Info

Thread: Fw: Which databases can Google Scholar crawl?




Fw: Which databases can Google Scholar crawl?
country flaguser name
United States
2008-02-19 10:28:47
                                                            
              
             Kathryn                                        
              
             Silberger/ADM/Mar                              
              
             ist                                            
           To 
                                       "B.G.
Sloan" <bgsloan2yahoo.com>   
             02/19/2008 11:27                               
           cc 
             AM                                             
              
                                                            
      Subject 
                                       Re: [Web4lib] Which
databases can   
                                       Google Scholar
crawl?(Document      
                                       link: Kathryn
Silberger)            
                                                            
              
                                                            
              
                                                            
              
                                                            
              
                                                            
              
                                                            
              




Alan's question, "Has anyone ever seen, or attempted, a
canonical list?" is
a good one.  This could be an activity of some ALA
committee.  It is
certainly a crucial step in being able to evaluate GS as an
information
source.  It is important that as a profession we don't
become passive about
allowing publishers and aggregators to take control of
collection
development decisions.


Katy

Kathryn K. Silberger
Automation Resources Librarian
James A. Cannavino Library
Marist College
3399 North Road
Poughkeepsie, NY  12601
Kathryn.Silbergermarist.edu
(845) 575-3000 x.2419


                                                            
              
             "B.G. Sloan"                         
                        
             <bgsloan2yahoo.c                    
                        
             om>                                         
              To 
             Sent by:                  web4libwebjunction.org             
             web4lib-bouncesw                          
               cc 
             ebjunction.org                                 
              
                                                            
      Subject 
                                       Re: [Web4lib] Which
databases can   
             02/19/2008 09:59          Google Scholar crawl?
              
             AM                                             
              
                                                            
              
                                                            
              
                                                            
              
                                                            
              
                                                            
              





  From the Wikipedia entry on Google Scholar:

  "A significant problem with GS is the secrecy about
its coverage...GS
refuses to publish a list of scientific journals crawled,
and the frequency
of its updates is unknown. It is therefore impossible to
know how current
and/or exhaustive searches are in GS."

  Bernie Sloan

Alan Cockerill <alan.cockerilljcu.edu.au> wrote:
  Hi

Has anyone ever seen a list of what providers have 'opened
up' their
indexing and abstracting data to Google Scholar?

Clearly large free sources like Pubmed and Scirus can be
searched via
Scholar but what subscription-only dbs can be reliably
searched via
Scholar?

Common sense tells me only fulltext providers would allow
crawling of the
I&A sections of their products as Scholar opens another
channel to
potential
article purchasers.

Trial and error tells me that Emerald, JSTOR, Sage,
Informaworld and
Ingenta
(somewhat patchily) have done this, and local Oz publisher
RMIT publishing
has allowed a couple of its fulltext datasets to be crawled
by Google as
well.

Has anyone ever seen, or attempted, a canonical list?

Thanks, Alan.

Alan Cockerill
Library Technologies Coordinator
James Cook University, Cairns

PO Box 6811
CAIRNS QLD 4870
Phone: (07) 4042 1737
Fax: (07) 4042 1516
Email: Alan.Cockerilljcu.edu.au

http:/
/www.library.jcu.edu.au/Staff/alan.shtml

CRICOS Provider Code: 00117J (QLD)



_______________________________________________
Web4lib mailing list
Web4libwebjunction.org
http://lists.we
bjunction.org/web4lib/



---------------------------------
Be a better friend, newshound, and know-it-all with Yahoo!
Mobile.  Try it
now.
_______________________________________________
Web4lib mailing list
Web4libwebjunction.org
http://lists.we
bjunction.org/web4lib/


_______________________________________________
Web4lib mailing list
Web4libwebjunction.org
http://lists.we
bjunction.org/web4lib/

Re: Fw: Which databases can Google Scholar crawl?
country flaguser name
United States
2008-02-19 14:11:20
At 11:28 AM 2/19/2008, Kathryn Silberger wrote:
>This could be an activity of some ALA committee.

Why don't we just expand Alan's original trial & error
approach?

If someone has a wiki page they could spare, the collective
web4lib 
community could probably put together a reasonably thorough
listing 
in a matter of hours.

Just pick a few databases from your collection, search 
inurl:my.database.com (or various other methods) and post
your 
findings/difficulties.
While not absolutely exhaustive it would give a pretty
reasonable 
sense of the coverage for major databases.

I'd offer up a page myself, but being in a corporate library
I don't 
have a public facing page to offer, anyone else?

--Will


_______________________________________________
Web4lib mailing list
Web4libwebjunction.org
http://lists.we
bjunction.org/web4lib/

RE: Which databases can Google Scholar crawl?
country flaguser name
Australia
2008-02-26 23:37:44
Does anyone know the status of what was the Crossref/Google
Pilot?
http://ww
w.crossref.org/crossrefsearch.html
It started in 2004 (I think) and Google was index Crossref
publisher
members' sites and publisher sites were putting search boxes
for it on their
sites, Nature's is here:
htt
p://www.nature.com/search/search_crossref.html
Near as I can tell it's just a google search with the
name/value pair
'restrict=crossref' tacked onto the google search URL.
Weirder yet, you can add the same restriction parameter to
Google Scholar
and get pretty neat results from the approximately 45
publishers involved
(and listed in the first link above).
This just gets murkier...
Cheers, Alan.
Alan Cockerill
Library Technologies Coordinator
James Cook University, Cairns 
PO Box 6811
CAIRNS QLD 4870
Phone: (07) 4042 1737
Fax: (07) 4042 1516
Email: Alan.Cockerilljcu.edu.au
http:/
/www.library.jcu.edu.au/Staff/alan.shtml

CRICOS Provider Code: 00117J (QLD) 


> -----Original Message-----
> From: web4lib-bounceswebjunction.org
[mailto:web4lib-
> bounceswebjunction.org] On Behalf Of Alan Cockerill
> Sent: Wednesday, 20 February 2008 8:45 AM
> To: web4libwebjunction.org
> Subject: RE: Fw: [Web4lib] Which databases can Google
Scholar crawl?
> 
> Thanks Will, and everyone else who's contributed,
> We still have an issue in identifying the completeness
of coverage, but
> this
> list is a good start - if a few of us blog it maybe
it'll attract enough
> attention that Google will care enough to set us right?
(I know, I'm
> dreaming).
> Ciao,
> Alan.
> 
> Alan Cockerill
> Library Technologies Coordinator
> James Cook University, Cairns
> 
> PO Box 6811
> CAIRNS QLD 4870
> Phone: (07) 4042 1737
> Fax: (07) 4042 1516
> Email: Alan.Cockerilljcu.edu.au
> http:/
/www.library.jcu.edu.au/Staff/alan.shtml
> 
> CRICOS Provider Code: 00117J (QLD)
> 
> 
> > -----Original Message-----
> > From: web4lib-bounceswebjunction.org
[mailto:web4lib-
> > bounceswebjunction.org] On Behalf Of Will Kurt
> > Sent: Wednesday, 20 February 2008 8:11 AM
> > To: Kathryn Silberger; web4libwebjunction.org
> > Subject: Re: Fw: [Web4lib] Which databases can
Google Scholar crawl?
> >
> > At 03:11 PM 2/19/2008, Will Kurt wrote:
> > >anyone else?
> > ... or that someone could be me 
> > I've taken a few minutes and quickly setup a
phpWiki and put it on a
> > site that I run outside of work:
> >
> > 
http://lib-bling.com/scholar/index.php?GoogleScholar
> >
> > I put in a few entries mainly as a demo, but
please add and see if we
> > can get a good list going.  Knowing what is not
indexed is equally as
> > important as knowing what is. If enough people
contribute this could
> > be a really useful.
> > --Will
> >
> > _______________________________________________
> > Web4lib mailing list
> > Web4libwebjunction.org
> > http://lists.we
bjunction.org/web4lib/
> 
> _______________________________________________
> Web4lib mailing list
> Web4libwebjunction.org
> http://lists.we
bjunction.org/web4lib/

_______________________________________________
Web4lib mailing list
Web4libwebjunction.org
http://lists.we
bjunction.org/web4lib/

RE: Which databases can Google Scholar crawl?
country flaguser name
Australia
2008-02-26 23:37:44
Does anyone know the status of what was the Crossref/Google
Pilot?
http://ww
w.crossref.org/crossrefsearch.html
It started in 2004 (I think) and Google was index Crossref
publisher
members' sites and publisher sites were putting search boxes
for it on their
sites, Nature's is here:
htt
p://www.nature.com/search/search_crossref.html
Near as I can tell it's just a google search with the
name/value pair
'restrict=crossref' tacked onto the google search URL.
Weirder yet, you can add the same restriction parameter to
Google Scholar
and get pretty neat results from the approximately 45
publishers involved
(and listed in the first link above).
This just gets murkier...
Cheers, Alan.
Alan Cockerill
Library Technologies Coordinator
James Cook University, Cairns 
PO Box 6811
CAIRNS QLD 4870
Phone: (07) 4042 1737
Fax: (07) 4042 1516
Email: Alan.Cockerilljcu.edu.au
http:/
/www.library.jcu.edu.au/Staff/alan.shtml

CRICOS Provider Code: 00117J (QLD) 


> -----Original Message-----
> From: web4lib-bounceswebjunction.org
[mailto:web4lib-
> bounceswebjunction.org] On Behalf Of Alan Cockerill
> Sent: Wednesday, 20 February 2008 8:45 AM
> To: web4libwebjunction.org
> Subject: RE: Fw: [Web4lib] Which databases can Google
Scholar crawl?
> 
> Thanks Will, and everyone else who's contributed,
> We still have an issue in identifying the completeness
of coverage, but
> this
> list is a good start - if a few of us blog it maybe
it'll attract enough
> attention that Google will care enough to set us right?
(I know, I'm
> dreaming).
> Ciao,
> Alan.
> 
> Alan Cockerill
> Library Technologies Coordinator
> James Cook University, Cairns
> 
> PO Box 6811
> CAIRNS QLD 4870
> Phone: (07) 4042 1737
> Fax: (07) 4042 1516
> Email: Alan.Cockerilljcu.edu.au
> http:/
/www.library.jcu.edu.au/Staff/alan.shtml
> 
> CRICOS Provider Code: 00117J (QLD)
> 
> 
> > -----Original Message-----
> > From: web4lib-bounceswebjunction.org
[mailto:web4lib-
> > bounceswebjunction.org] On Behalf Of Will Kurt
> > Sent: Wednesday, 20 February 2008 8:11 AM
> > To: Kathryn Silberger; web4libwebjunction.org
> > Subject: Re: Fw: [Web4lib] Which databases can
Google Scholar crawl?
> >
> > At 03:11 PM 2/19/2008, Will Kurt wrote:
> > >anyone else?
> > ... or that someone could be me 
> > I've taken a few minutes and quickly setup a
phpWiki and put it on a
> > site that I run outside of work:
> >
> > 
http://lib-bling.com/scholar/index.php?GoogleScholar
> >
> > I put in a few entries mainly as a demo, but
please add and see if we
> > can get a good list going.  Knowing what is not
indexed is equally as
> > important as knowing what is. If enough people
contribute this could
> > be a really useful.
> > --Will
> >
> > _______________________________________________
> > Web4lib mailing list
> > Web4libwebjunction.org
> > http://lists.we
bjunction.org/web4lib/
> 
> _______________________________________________
> Web4lib mailing list
> Web4libwebjunction.org
> http://lists.we
bjunction.org/web4lib/

_______________________________________________
Web4lib mailing list
Web4libwebjunction.org
http://lists.we
bjunction.org/web4lib/

RE: RE: Which databases can Google Scholar crawl?
country flaguser name
Australia
2008-02-28 17:40:44
As some follow up
Ed Pentz from CrossRef tells me that the Google/CrossRef
Search Pilot is on
hold, but as far as he is aware Google is still indexing the
45 publishers
involved (rules of indexing agreed between Google and
individual
publishers).
I'm still not sure whether Google Scholar uses that data.
But using the
restriction=crossref in vanilla google you get a mini google
scholar. The
list of publishers is:
    *  Alphamed Press
    * American Institute of Physics
    * American Physical Society
    * American Psychiatric Publishing
    * American Society for Biochemistry and Molecular
Biology
    * American Society of Civil Engineers
    * Annual Reviews
    * Ashley Publications
    * Association for Computing Machinery
    * Austrian Academy of Sciences Press
    * BioMed Central
    * Blackwell Publishing
    * BMJ Publishing Group
    * Cambridge University Press
    * Cold Spring Harbor Laboratory Press
    * EDP Science
    * FASEB
    * IEEE
    * INFORMS
    * Institute of Organic Chemistry and Biochemistry,
Academy of Sciences
of the Czech Republic
    * Institute of Physics Publishing
    * International Union of Crystallography
    * Investigative Ophthamology and Visual Science
    * Institute of Pure and Applied Physics (IPAP)
    * Journal of Clinical Oncology
    * S. Karger AG
    * Lawrence Erlbaum Associates
    * Mary Ann Liebert
    * Medicine Publishing Group
    * Nature Publishing Group
    * Oldenbourg Wissenschaftsverlag
    * Oxford University Press
    * Peeters Publishers
    * PNAS
    * RILEM Publications SARL
    * Royal College of Psychiatrists
    * Springer-Verlag
    * Taylor & Francis
    * Thieme Publishing Group
    * University of California Press
    * University of Chicago Press
    * Vathek Publishing
    * John Wiley & Sons
    * Wolters Kluwer International Health & Science
    * The World Bank

Alan Cockerill
Library Technologies Coordinator
James Cook University, Cairns 

PO Box 6811
CAIRNS QLD 4870
Phone: (07) 4042 1737
Fax: (07) 4042 1516
Email: Alan.Cockerilljcu.edu.au
http:/
/www.library.jcu.edu.au/Staff/alan.shtml

CRICOS Provider Code: 00117J (QLD) 


> -----Original Message-----
> From: web4lib-bounceswebjunction.org
[mailto:web4lib-
> bounceswebjunction.org] On Behalf Of Alan Cockerill
> Sent: Wednesday, 27 February 2008 3:38 PM
> To: web4libwebjunction.org
> Subject: [Web4lib] RE: Which databases can Google
Scholar crawl?
> 
> Does anyone know the status of what was the
Crossref/Google Pilot?
> http://ww
w.crossref.org/crossrefsearch.html
> It started in 2004 (I think) and Google was index
Crossref publisher
> members' sites and publisher sites were putting search
boxes for it on
> their
> sites, Nature's is here:
> htt
p://www.nature.com/search/search_crossref.html
> Near as I can tell it's just a google search with the
name/value pair
> 'restrict=crossref' tacked onto the google search URL.
> Weirder yet, you can add the same restriction parameter
to Google Scholar
> and get pretty neat results from the approximately 45
publishers involved
> (and listed in the first link above).
> This just gets murkier...
> Cheers, Alan.
> Alan Cockerill
> Library Technologies Coordinator
> James Cook University, Cairns
> PO Box 6811
> CAIRNS QLD 4870
> Phone: (07) 4042 1737
> Fax: (07) 4042 1516
> Email: Alan.Cockerilljcu.edu.au
> http:/
/www.library.jcu.edu.au/Staff/alan.shtml
> 
> CRICOS Provider Code: 00117J (QLD)
> 
> 
> > -----Original Message-----
> > From: web4lib-bounceswebjunction.org
[mailto:web4lib-
> > bounceswebjunction.org] On Behalf Of Alan
Cockerill
> > Sent: Wednesday, 20 February 2008 8:45 AM
> > To: web4libwebjunction.org
> > Subject: RE: Fw: [Web4lib] Which databases can
Google Scholar crawl?
> >
> > Thanks Will, and everyone else who's contributed,
> > We still have an issue in identifying the
completeness of coverage, but
> > this
> > list is a good start - if a few of us blog it
maybe it'll attract enough
> > attention that Google will care enough to set us
right? (I know, I'm
> > dreaming).
> > Ciao,
> > Alan.
> >
> > Alan Cockerill
> > Library Technologies Coordinator
> > James Cook University, Cairns
> >
> > PO Box 6811
> > CAIRNS QLD 4870
> > Phone: (07) 4042 1737
> > Fax: (07) 4042 1516
> > Email: Alan.Cockerilljcu.edu.au
> > http:/
/www.library.jcu.edu.au/Staff/alan.shtml
> >
> > CRICOS Provider Code: 00117J (QLD)
> >
> >
> > > -----Original Message-----
> > > From: web4lib-bounceswebjunction.org
[mailto:web4lib-
> > > bounceswebjunction.org] On Behalf Of Will Kurt
> > > Sent: Wednesday, 20 February 2008 8:11 AM
> > > To: Kathryn Silberger; web4libwebjunction.org
> > > Subject: Re: Fw: [Web4lib] Which databases
can Google Scholar crawl?
> > >
> > > At 03:11 PM 2/19/2008, Will Kurt wrote:
> > > >anyone else?
> > > ... or that someone could be me 
> > > I've taken a few minutes and quickly setup a
phpWiki and put it on a
> > > site that I run outside of work:
> > >
> > > 
http://lib-bling.com/scholar/index.php?GoogleScholar
> > >
> > > I put in a few entries mainly as a demo, but
please add and see if we
> > > can get a good list going.  Knowing what is
not indexed is equally as
> > > important as knowing what is. If enough
people contribute this could
> > > be a really useful.
> > > --Will
> > >
> > >
_______________________________________________
> > > Web4lib mailing list
> > > Web4libwebjunction.org
> > > http://lists.we
bjunction.org/web4lib/
> >
> > _______________________________________________
> > Web4lib mailing list
> > Web4libwebjunction.org
> > http://lists.we
bjunction.org/web4lib/
> 
> _______________________________________________
> Web4lib mailing list
> Web4libwebjunction.org
> http://lists.we
bjunction.org/web4lib/

_______________________________________________
Web4lib mailing list
Web4libwebjunction.org
http://lists.we
bjunction.org/web4lib/

RE: RE: Which databases can Google Scholar crawl?
country flaguser name
Australia
2008-02-28 17:40:44
As some follow up
Ed Pentz from CrossRef tells me that the Google/CrossRef
Search Pilot is on
hold, but as far as he is aware Google is still indexing the
45 publishers
involved (rules of indexing agreed between Google and
individual
publishers).
I'm still not sure whether Google Scholar uses that data.
But using the
restriction=crossref in vanilla google you get a mini google
scholar. The
list of publishers is:
    *  Alphamed Press
    * American Institute of Physics
    * American Physical Society
    * American Psychiatric Publishing
    * American Society for Biochemistry and Molecular
Biology
    * American Society of Civil Engineers
    * Annual Reviews
    * Ashley Publications
    * Association for Computing Machinery
    * Austrian Academy of Sciences Press
    * BioMed Central
    * Blackwell Publishing
    * BMJ Publishing Group
    * Cambridge University Press
    * Cold Spring Harbor Laboratory Press
    * EDP Science
    * FASEB
    * IEEE
    * INFORMS
    * Institute of Organic Chemistry and Biochemistry,
Academy of Sciences
of the Czech Republic
    * Institute of Physics Publishing
    * International Union of Crystallography
    * Investigative Ophthamology and Visual Science
    * Institute of Pure and Applied Physics (IPAP)
    * Journal of Clinical Oncology
    * S. Karger AG
    * Lawrence Erlbaum Associates
    * Mary Ann Liebert
    * Medicine Publishing Group
    * Nature Publishing Group
    * Oldenbourg Wissenschaftsverlag
    * Oxford University Press
    * Peeters Publishers
    * PNAS
    * RILEM Publications SARL
    * Royal College of Psychiatrists
    * Springer-Verlag
    * Taylor & Francis
    * Thieme Publishing Group
    * University of California Press
    * University of Chicago Press
    * Vathek Publishing
    * John Wiley & Sons
    * Wolters Kluwer International Health & Science
    * The World Bank

Alan Cockerill
Library Technologies Coordinator
James Cook University, Cairns 

PO Box 6811
CAIRNS QLD 4870
Phone: (07) 4042 1737
Fax: (07) 4042 1516
Email: Alan.Cockerilljcu.edu.au
http:/
/www.library.jcu.edu.au/Staff/alan.shtml

CRICOS Provider Code: 00117J (QLD) 


> -----Original Message-----
> From: web4lib-bounceswebjunction.org
[mailto:web4lib-
> bounceswebjunction.org] On Behalf Of Alan Cockerill
> Sent: Wednesday, 27 February 2008 3:38 PM
> To: web4libwebjunction.org
> Subject: [Web4lib] RE: Which databases can Google
Scholar crawl?
> 
> Does anyone know the status of what was the
Crossref/Google Pilot?
> http://ww
w.crossref.org/crossrefsearch.html
> It started in 2004 (I think) and Google was index
Crossref publisher
> members' sites and publisher sites were putting search
boxes for it on
> their
> sites, Nature's is here:
> htt
p://www.nature.com/search/search_crossref.html
> Near as I can tell it's just a google search with the
name/value pair
> 'restrict=crossref' tacked onto the google search URL.
> Weirder yet, you can add the same restriction parameter
to Google Scholar
> and get pretty neat results from the approximately 45
publishers involved
> (and listed in the first link above).
> This just gets murkier...
> Cheers, Alan.
> Alan Cockerill
> Library Technologies Coordinator
> James Cook University, Cairns
> PO Box 6811
> CAIRNS QLD 4870
> Phone: (07) 4042 1737
> Fax: (07) 4042 1516
> Email: Alan.Cockerilljcu.edu.au
> http:/
/www.library.jcu.edu.au/Staff/alan.shtml
> 
> CRICOS Provider Code: 00117J (QLD)
> 
> 
> > -----Original Message-----
> > From: web4lib-bounceswebjunction.org
[mailto:web4lib-
> > bounceswebjunction.org] On Behalf Of Alan
Cockerill
> > Sent: Wednesday, 20 February 2008 8:45 AM
> > To: web4libwebjunction.org
> > Subject: RE: Fw: [Web4lib] Which databases can
Google Scholar crawl?
> >
> > Thanks Will, and everyone else who's contributed,
> > We still have an issue in identifying the
completeness of coverage, but
> > this
> > list is a good start - if a few of us blog it
maybe it'll attract enough
> > attention that Google will care enough to set us
right? (I know, I'm
> > dreaming).
> > Ciao,
> > Alan.
> >
> > Alan Cockerill
> > Library Technologies Coordinator
> > James Cook University, Cairns
> >
> > PO Box 6811
> > CAIRNS QLD 4870
> > Phone: (07) 4042 1737
> > Fax: (07) 4042 1516
> > Email: Alan.Cockerilljcu.edu.au
> > http:/
/www.library.jcu.edu.au/Staff/alan.shtml
> >
> > CRICOS Provider Code: 00117J (QLD)
> >
> >
> > > -----Original Message-----
> > > From: web4lib-bounceswebjunction.org
[mailto:web4lib-
> > > bounceswebjunction.org] On Behalf Of Will Kurt
> > > Sent: Wednesday, 20 February 2008 8:11 AM
> > > To: Kathryn Silberger; web4libwebjunction.org
> > > Subject: Re: Fw: [Web4lib] Which databases
can Google Scholar crawl?
> > >
> > > At 03:11 PM 2/19/2008, Will Kurt wrote:
> > > >anyone else?
> > > ... or that someone could be me 
> > > I've taken a few minutes and quickly setup a
phpWiki and put it on a
> > > site that I run outside of work:
> > >
> > > 
http://lib-bling.com/scholar/index.php?GoogleScholar
> > >
> > > I put in a few entries mainly as a demo, but
please add and see if we
> > > can get a good list going.  Knowing what is
not indexed is equally as
> > > important as knowing what is. If enough
people contribute this could
> > > be a really useful.
> > > --Will
> > >
> > >
_______________________________________________
> > > Web4lib mailing list
> > > Web4libwebjunction.org
> > > http://lists.we
bjunction.org/web4lib/
> >
> > _______________________________________________
> > Web4lib mailing list
> > Web4libwebjunction.org
> > http://lists.we
bjunction.org/web4lib/
> 
> _______________________________________________
> Web4lib mailing list
> Web4libwebjunction.org
> http://lists.we
bjunction.org/web4lib/

_______________________________________________
Web4lib mailing list
Web4libwebjunction.org
http://lists.we
bjunction.org/web4lib/

[1-6]

about | contact  Other archives ( Real Estate discussion Medical topics )