phyc
Joined: Eons Ago
# Posts: 47
|
Posted: 09/17/2003 06:52 am
How do I remove an .asp page from googles database, using a robots.txt?
|
 |
crash
Insider
Joined: Dec 02, 2003
# Posts: 10626
|
Posted: 09/17/2003 08:01 am
You can place a robots meta tag on the page or use a robots.txt file to exclude the page.
|
 |
phyc
Joined: Eons Ago
# Posts: 47
|
Posted: 09/17/2003 08:58 am
If I removed my page from my server will google eventually release it?
I accidentally had a duplicate page of my index page on my server that was linked with one of my internal pages. I just want to make sure that google knows that it is no longer there. The last thing I want is to have duplicate pages; I think I might have been penalized. If I was penalized will I ever be un-penalized. If so how long might it take?
|
 |
thejenn
Joined: Aug 08, 2001
# Posts: 9196
|
Posted: 09/17/2003 10:02 am
You can find instructions on removing pages of your site from Google on their web site.
|
 |
bikeman
Joined: Feb 13, 2005
# Posts: 130
|
Posted: 01/20/2006 03:56 am
Doesn't work - google just delists them for 6 months then they come back. For some unknown reason google never removes a page from it's database. Pages that were deleted years ago are still listed.
At a time when google wants to clean up it's databases and is discouraging duplicate content, it is strange that they resolutely refuse to delete anything. Their bot knows they get a 404 yet they still list deleted pages in their results.
By all means use the google remove url tool but expect the page to come back after 6 months.
Google has lost the plot - forget about them, your clients will eventually.
|
 |
g1smd
Moderator
Joined: Jul 28, 2002
# Posts: 10181
|
Posted: 01/20/2006 04:32 am
Use <meta name="robots" content="noinxdex"> on the page.
|
 |
bikeman
Joined: Feb 13, 2005
# Posts: 130
|
Posted: 01/21/2006 09:15 am
That wil stop a page geting listed but wont remove one already there.
|
 |
g1smd
Moderator
Joined: Jul 28, 2002
# Posts: 10181
|
Posted: 01/21/2006 12:15 pm
Oh yes it will!
It's the robots.txt file entries that will not delist an already listed file in Google.
The robots meta tag will get it delisted within a couple of weeks (sometimes just a few days).
|
 |
bikeman
Joined: Feb 13, 2005
# Posts: 130
|
Posted: 06/27/2006 04:52 am
Ok so how do you remove pages from google's listing that have already been deleted?
Google lists literally hundreds of old pages from my site, some of these pages were deleted 2 years ago. I am not going to recreate them just so I can insert a robots meta tag.
I have a html sitemap and I have uploaded a google xml map. Google claims to have verified my sitemap and visit regularly but so far it steadfastly refuses to remove any deleted pages.
What gives?
|
 |
Dinkar
Moderator
Joined: Aug 12, 2001
# Posts: 4316
|
Posted: 06/27/2006 05:29 am
how do you remove pages from google's listing that have already been deleted?
Get a court order!
|
 |
bikeman
Joined: Feb 13, 2005
# Posts: 130
|
Posted: 06/30/2006 01:40 am
Ha ha, but I really don't undersatnd why google do this. They have a remove page submission form that they ignore.
Isn't it in their interest not to list pages that don't exist?
What is their incentive for giving inaccurate search results?
I use MSN if I want accurate results - hopefully one day Google will get the kicking they deserve
|
 |
bikeman
Joined: Feb 13, 2005
# Posts: 130
|
Posted: 06/30/2006 01:40 am
Ha ha, but I really don't undersatnd why google do this. They have a remove page submission form that they ignore.
Isn't it in their interest not to list pages that don't exist?
What is their incentive for giving inaccurate search results?
I use MSN if I want accurate results - hopefully one day Google will get the kicking they deserve
|
 |
g1smd
Moderator
Joined: Jul 28, 2002
# Posts: 10181
|
Posted: 06/30/2006 01:11 pm
If the page returns a 404 error, then use the Google URL Console to get rid of it from view for 6 months at a time.
|
 |
5555559999999
Joined: Jun 29, 2008
# Posts: 3
|
Posted: 06/29/2008 07:24 am
I have the below tag on ONE PAGE of my site:
<meta name="robots" content="noindex,nofollow">
It has been there for a uple of months, the site has been crawled by Google numerous times since I put it there bt the page is still indexed. You can find it in the SERP's.
Is there anything I can do to get it removed?
IE
|
 |
Quadrille
Joined: Nov 15, 2000
# Posts: 1064
|
Posted: 06/29/2008 08:43 am
see thejenn's 2003 note above - it's still valid!
|
 |
5555559999999
Joined: Jun 29, 2008
# Posts: 3
|
Posted: 06/29/2008 05:58 pm
Well, I thought it did which is why I put the appropriate meta tag
<meta name="robots" content="noindex,nofollow">
in a couple of months ago. It is in the header section of the page as Google says in the webmaster help center, the one thejenn quotes above.
As I say though I have had that there on the page I don't want indexed for a couple of months now. During this time Google has crawled the site a dozen if not 2o times but the page still shows up in the SERP's.
If you want to look for yourself do a search on "Act One Terms and Conditions". The second impression (as of now) is the page I don't want found. If you look in the code you'll see the no follow, no index tag is there.
I don't know what else to do.
IE
[ Message was edited by: 5555559999999 06/29/2008 06:48 pm ]
[ Message was edited by: 5555559999999 06/29/2008 11:16 pm ]
|
 |
Quadrille
Joined: Nov 15, 2000
# Posts: 1064
|
Posted: 06/29/2008 07:03 pm
The page says:
"If the content is currently in our index, we will remove it after the next time we crawl it. To expedite removal, use the URL removal request tool in Google Webmaster Tools."
|
 |
Dinkar
Moderator
Joined: Aug 12, 2001
# Posts: 4316
|
Posted: 06/29/2008 09:29 pm
<meta name="robots" content="noindex,nofollow">
Try using this:
<meta name="ROBOTS" content="NONE">
Also 'Disallow' the page in your robots.txt file and use their URL removal request tool.
|
 |
5555559999999
Joined: Jun 29, 2008
# Posts: 3
|
Posted: 06/29/2008 11:19 pm
Thank you.
Does it matter where on the page I put the <meta name="ROBOTS" content="NONE">
Can you please explain to me how to disallow the page in the robot.txt file.
Thanks,
IE
|
 |
Quadrille
Joined: Nov 15, 2000
# Posts: 1064
|
Posted: 06/30/2008 01:56 am
between <head> and </head>
All the metatags should be there.
|
 |