List Info

Thread: Re: Google News problem




Re: Google News problem
country flaguser name
Czech Republic
2007-03-20 03:36:10

Hi,

Thanks for addressing this issue. Our marketing guy has been on me about this for months ....

I may be wrong about this, but I don’t think the proposed solution will work. My understanding is that a redirect will cause crawlers from Google and other search engines to index the page under the new target URL, not the URL from which the page is being redirected. If this is the case, there would need to be content at http://[site_name]/[section_URL_name]/ in order to make Google happy.

 

As an interim solution, we’ve been playing with the idea of simply creating the desired directories and then including the relevant section pages from the current issue using php. Aside from being kind of messy, this results in slower page loads and the pages don’t recognize logins. It does produce Google-indexable pages at the desired URLs, but if there̵7;s a better way to do this in Campsite, I’d be grateful.

 

And again, my apologies if I’ve misunderstood something about how Google works!

Best,

Evan



Mugur Rus wrote:
web56504.mail.re3.yahoo.com" type="cite">
Hi,

I attached here a PHP script named section_relocation.php. Please copy this script to your
instance HTML directory:
    [campsite_dir]/www/[instance_name]/html.


E.g. for the default campsite install and default campsite instance this directory is:
    /usr/local/campsite/www/campsite/html

Open a shell and type the following commands:


1. cd [campsite_dir]/www/[instance_name]/html
2. ln -s section_relocation.php [section_URL_name]


Repeat step 2 for each section in your publication. Replace [section_URL_name] with the value of
the field "URL Name" from section configuration screen.


This will redirect requests like:
    http://[site_name]/[section_URL_name]/
to
    http://[site_name]/[default_pub_language]/[current_issue]/[section_URL_name]/

That's all.

Mugur

--- gmail.com">digitalnegociogmail.com wrote:
  
Thanks for your reply, Mugur, and sorry for my mistake in the url taxonomy.

The problem is Google needs the sections URL to categorize every article,
and campsite sections have the language/issue in it (they are not 'clean'
URL's that the spider can follow).

Maybe the solution is to create some kind of alias in the VirtualHost (
httpd.conf). Or maybe to mod_rewrite the URL to put the section in the first
place and remove the rest. I don't know and I'm a bit worried.

Thanks.




2007/3/15, Mugur Rus yahoo.com"><mugur1973yahoo.com&gt;:
    
Hi,

The correct Campsite URL is:
- http://www.server.com/
- http://www.server.com/language/
- http://www.server.com/language/issue/
- http://www.server.com/language/issue/section/
- http://www.server.com/language/issue/section/article

depending on the what are you viewing. There IS NO date in this URL. The
shortest URL -
http://www.server.com/ - ALWAYS displays the latest published issue. So if
yesterday the latest
issue was 10 and today you published issue 11 then today
http://www.server.com/ will display issue
11.

Hope this helps,
Mugur

--- gmail.com">digitalnegociogmail.com wrote:
      
Google is going to include our website in its news service. But they
        
warned
      
me about a problem with the newspaper section URL: the format
http://www.server.com/section/language/date/ will make the google spider
        
to
      
index only the actual edition of the site and not the following days.

Is there is any way to get URL's like  http://www.server.com/section1,
http://www.server.com/section2 , http://www.server.com/section3... that
poins to the actual edition of every section? Did someone front this
        
problem
      
before? What are the best options to solve it -redirection,
        
mod_rewrite...-?
      
Thanks.


        


____________________________________________________________________________________
Don't get soaked.  Take a quick peek at the forecast
with the Yahoo! Search weather shortcut.

http://tools.search.yahoo.com/shortcuts/#loc_weather


      



 
____________________________________________________________________________________

Sucker-punch spam with award-winning protection. 
Try the free Yahoo! Mail Beta.
http://advision.webevents.yahoo.com/mailbeta/features_spam.html

  


--
Evan W. Mellander

Evan W. Mellander, Ph.D.

Monitor CE / Prague Daily Monitor

Přemyslovsk&aacute; 38, 130 00 Praha 3

tel: +420 775 90 96 90

email: evanmonitorce.com">evanmonitorce.com

http://www.monitorce.com/

http://www.praguemonitor.com/

Re: Google News problem
user name
2007-03-20 12:49:37
Evan, as far as I know, the google&nbsp;spider only needs one fixed URL for section, just to know where it should begin to crawl. So Mugur's solution seems to be effective (anyway, I will tell you what they tell me about it).
&nbsp;


&nbsp;
2007/3/20, Evan Mellander < evanpraguemonitor.com">evanpraguemonitor.com>;:

Hi,

Thanks for addressing this issue. Our marketing guy has been on me about this for months ....

I may be wrong about this, but I don't think the proposed solution will work. My understanding is that a redirect will cause crawlers from Google and other search engines to index the page under the new target URL, not the URL from which the page is being redirected. If this is the case, there would need to be content at http://[site_name]/[section_URL_name]/ in order to make Google happy.

 

As an interim solution, we've been playing with the idea of simply creating the desired directories and then including the relevant section pages from the current issue using php. Aside from being kind of messy, this results in slower page loads and the pages don't recognize logins. It does produce Google-indexable pages at the desired URLs, but if there's a better way to do this in Campsite, I'd be grateful.

 

And again, my apologies if I've misunderstood something about how Google works!

Best,

Evan



Mugur Rus wrote:
web56504.mail.re3.yahoo.com" type="cite">
Hi,

I attached here a PHP script named section_relocation.php. Please copy this script to your
instance HTML directory:
    [campsite_dir]/www/[instance_name]/html.

E.g. for the default campsite install and default campsite instance this directory is:
    /usr/local/campsite/www/campsite/html

Open a shell and type the following commands:

1. cd [campsite_dir]/www/[instance_name]/html
2. ln -s section_relocation.php [section_URL_name]

Repeat step 2 for each section in your publication. Replace [section_URL_name] with the value of
the field "URL Name" from section configuration screen.

This will redirect requests like:
    http://[site_name]/[section_URL_name]/
to
    http://[site_name]/[default_pub_language]/[current_issue]/[section_URL_name]/

That9;s all.

Mugur

--- gmail.com" target="_blank">digitalnegociogmail.com wrote:
  
Thanks for your reply, Mugur, and sorry for my mistake in the url taxonomy.
The problem is Google needs the sections URL to categorize every article,
and campsite sections have the language/issue in it (they are not 'clean'
URL&#39;s that the spider can follow).

Maybe the solution is to create some kind of alias in the VirtualHost (
httpd.conf). Or maybe to mod_rewrite the URL to put the section in the first
place and remove the rest. I don't know and I'm a bit worried.

Thanks.




2007/3/15, Mugur Rus yahoo.com" target="_blank"><mugur1973yahoo.com&gt;:
    
Hi,

The correct Campsite URL is:
- http://www.server.com/
- http://www.server.com/language/
- http://www.server.com/language/issue/
- http://www.server.com/language/issue/section/
- http://www.server.com/language/issue/section/article
depending on the what are you viewing. There IS NO date in this URL. The
shortest URL -
http://www.server.com/ - ALWAYS displays the latest published issue. So if
yesterday the latest
issue was 10 and today you published issue 11 then today
http://www.server.com/ will display issue
11.

Hope this helps,
Mugur

--- gmail.com" target="_blank">digitalnegociogmail.com wrote:
      
Google is going to include our website in its news service. But they
        
warned
      
me about a problem with the newspaper section URL: the format
http://www.server.com/section/language/date/ will make the google spider
        
to
      
index only the actual edition of the site and not the following days.

Is there is any way to get URL's like  http://www.server.com/section1,
http://www.server.com/section2 , 
http://www.server.com/section3... that
poins to the actual edition of every section? Did someone front this
        
problem
      
before? What are the best options to solve it -redirection,
        
mod_rewrite...-?
      
Thanks.

        
____________________________________________________________________________________
Don't get soaked.  Take a quick peek at the forecast
with the Yahoo! Search weather shortcut.
http://tools.search.yahoo.com/shortcuts/#loc_weather

      
 
____________________________________________________________________________________
Sucker-punch spam with award-winning protection. 
Try the free Yahoo! Mail Beta.
http://advision.webevents.yahoo.com/mailbeta/features_spam.html
  


--

Evan W. Mellander, Ph.D.

Monitor CE / Prague Daily Monitor

Přemyslovská 38, 130 00 Praha 3

tel: +420 775 90 96 90

email: monitorce.com" target="_blank">evanmonitorce.com

http://www.monitorce.com/

http://www.praguemonitor.com/


[1-2]

about | contact  Other archives ( Real Estate discussion Medical topics )