Yes. DNN Platform is a free, open-source CMS under the MIT License.

What is the tech stack for DNN?

DNN runs on ASP.NET (.NET Framework 4.8+) with Microsoft SQL Server (2017+ or Azure SQL).

DNN Platform is community-led and a member project of the .NET Foundation.

What’s the difference between DNN and Evoq?

DNN Platform is the open-source core that's been wildly popular and adopted for over 20 years. Evoq is a commercial offering maintained by DNN Software (IgniteTech) that builds on DNN.

DNN Forums

Ask questions about your website to get help learning DNN and help resolve issues.

Noindexing

10 Replies

0 Subscribed to this topic

40 Subscribed to this forum

Sort:

Author

Messages

alexismaria

Growing Member

Posts: 35

New Poster

2/9/2024 12:49 PM

Hi,

We have a webpage on our website that I have marked in the settings to present Google or other search to not crawl it. The webpage has to be on the site, just not searchable by a webcrawler. I have also asked Google to disallow indexing, which can be done temporarily. This is the webpage: https://www.nafo.int/Libr...rking-Papers/STACFAD

I also included in the page header tags a no index metatag:

I have also coded in the config files/robots to disallow these pages:

Disallow: /portals/0/Images/Secretariat/
Disallow: /Working-Papers/STACFAD
Disallow: /stacfadwp21-05.pdf

Why do the images from these pages still show up in google, especially if I use the search term "NAFO LOGO"? The images show up. I do not want them to show up. I thought I had set everything up in these areas to prevent this from happening.

Any suggestions are welcome. There are old images showing up as well that people seem to access and would prefer if they were not accessible via the internet.

Alexis

Will Strohl

Senior Member

Posts: 1607

MVP

New Poster

2/9/2024 12:58 PM

First, it sounds like you're doing all of the correct things.

In my personal and anecdotal experience doing these things should still always be done when it makes sense. However, there are nuances to this too. I've also found that Google will still crawl and have record of everything it can, but "indexing" is treated differently over "knowing the content is there."

In the case of the images, specifically, that's very interesting. It looks like you, again, doing the right things. I'd recommend going into the Google Search Console to try and remove them from the index and search results.

Will Strohl
Founder & CEO, Upendo Ventures

🛠️ Have a DNN Project? | 🛡️ Need SLA-based DNN Support?

🤔 Was my answer helpful to you? ❤️ Sponsor Us On GitHub

David Poindexter

Veteran Member

Posts: 360

Helpful Replier

MVP

2/9/2024 1:31 PM

For what it is worth, we run into some of the same issues with Google from time to time and it sounds like you are doing all the right things. Will's suggestions are great.

James Clarkson

Growing Member

Posts: 87

2/9/2024 5:42 PM

For exactly this situation two years ago, we wrote IIS Rewrite rules to redirect HTTP_USER_AGENT Googlebot for specific files to the 403 error page. It seems to have worked. We still needed to request removal of the images that had already been crawled and indexed. Also check what Internet Archive (waybackmachine) has indexed.

David Poindexter

Veteran Member

Posts: 360

Helpful Replier

MVP

2/9/2024 5:52 PM

Interesting and creative solution James - thanks for sharing!

RichardHowells

Growing Member

Posts: 51

2/9/2024 6:32 PM

I think that robots.txt ONLY requests "Please don't crawl this area". In my mind that's not "Please don't crawl this area AND delete anything you already have." So I assume that if those pages have EVER been crawled then, in principle, Google has them.

It's never been entirely clear to me *why* we would block a crawler. The pages/images presumably are not secret/confidential. If they were then I'd expect they'd be behind a password challenge. Why not just let the crawlers crawl?

James Clarkson

Growing Member

Posts: 87

2/9/2024 6:43 PM

In our case there were jpg thumbnails of copyright pdfs. The pdfs were protected behind registration sign-up, the copyright holder did not want anything indexed by google (not even the thumbnail images), but we still had to show the thumbnails on the site to encourage people to sign up.

David Poindexter

Veteran Member

Posts: 360

Helpful Replier

MVP

2/9/2024 7:10 PM

Posted By RichardHowells on 2/9/2024 5:32 PM
I think that robots.txt ONLY requests "Please don't crawl this area". In my mind that's not "Please don't crawl this area AND delete anything you already have." So I assume that if those pages have EVER been crawled then, in principle, Google has them.

It's never been entirely clear to me *why* we would block a crawler. The pages/images presumably are not secret/confidential. If they were then I'd expect they'd be behind a password challenge. Why not just let the crawlers crawl?

There are many reasons to keep a page from being indexed (avoid duplicate content SEO issues, marketing landing pages, special purpose pages that should be visited only from a specific user journey, etc.).

Marco Alvarado

Veteran Member

Posts: 370

Helpful Replier

New Poster

Engaged Reader

Avid Reader

Most Liked

2/12/2024 3:10 PM

You certainly want to make sure whats'a indexed; you can type "site:yourdomain.com" in Google search and it'll show all the pages indexed by Google from the specifeid domain. I use this a lot whenever I've to double check what's public and what's not.

Marco Alvarado

Veteran Member

Posts: 370

Helpful Replier

New Poster

Engaged Reader

Avid Reader

Most Liked

2/18/2024 6:55 PM

Hi everybody! I just found this article that will help you delete an image from Google index,

https://www.searchenginej...-search-index/508458

These Forums are for the discussion of the open source CMS DNN platform and ecosystem.

For the benefit of the community and to protect the integrity of the ecosystem, please observe the following posting guidelines:

If you have (suspected) security issues, please DO NOT post them in the forums but instead follow the official DNN security policy.
No Advertising. This includes the promotion of commercial and non-commercial products or services which are not directly related to DNN.
No vendor trolling / poaching. If someone posts about a vendor issue, allow the vendor or other customers to respond. Any post that looks like trolling / poaching will be removed.
Discussion or promotion of DNN Platform product releases under a different brand name are strictly prohibited.
No Flaming or Trolling.
No Profanity, Racism, or Prejudice.
Site Moderators have the final word on approving / removing a thread or post or comment.
English language posting only, please.

Would you like to help us?

Awesome! Simply post in the forums using the link below and we'll get you started.

Get Involved