# Sitemap # ----------------------- # Sitemap: http://blog.culturadigital.org/sitemap/ # DENEGANDO ACCESOS POR URLS: # ----------------------- # User-agent: * Disallow: /blog Disallow: /cgi-bin Disallow: /media Disallow: /static Disallow: http://static.culturadigital.org/ Disallow: http://media.culturadigital.org/curriculo/ Disallow: /foro_escuadrilla1sc ## Blog (Wordpress): # Root solo a index.php: Disallow: http://blog.culturadigital.org/wp-* Disallow: http://blog.culturadigital.org/xmlrpc.php # Dirs: Disallow: http://blog.culturadigital.org/wp-admin Disallow: http://blog.culturadigital.org/wp-content Disallow: http://blog.culturadigital.org/wp-includes # Asegurar subdirs: Disallow: http://blog.culturadigital.org/wp-content/plugins Disallow: http://blog.culturadigital.org/wp-content/themes Disallow: http://blog.culturadigital.org/wp-content/gallery Disallow: http://blog.culturadigital.org/wp-content/mycache Disallow: http://blog.culturadigital.org/wp-content/subidas Disallow: http://blog.culturadigital.org/wp-content/upgrade Disallow: http://blog.culturadigital.org/wp-content/languages # Secciones propias blog: Disallow: http://blog.culturadigital.org/apps Disallow: http://blog.culturadigital.org/cache Disallow: http://blog.culturadigital.org/7maravillas # Dinámica no permitida: Disallow: http://blog.culturadigital.org/?* # Asegurar otras dinámicas: Disallow: http://blog.culturadigital.org/?dl_id=* Disallow: http://blog.culturadigital.org/?s=* Disallow: http://blog.culturadigital.org/search* Disallow: http://blog.culturadigital.org/buscar* # Feeds, comentarios y trackbacks (suelen repetirse): Disallow: http://blog.culturadigital.org/trackback Disallow: http://blog.culturadigital.org/feed Disallow: http://blog.culturadigital.org/comments Disallow: http://blog.culturadigital.org/search Disallow: http://blog.culturadigital.org/buscar Disallow: http://blog.culturadigital.org/category Disallow: http://blog.culturadigital.org/categoría Disallow: http://blog.culturadigital.org/temas # Asegurar: Disallow: */trackback Disallow: */feed Disallow: */comments Disallow: */search Disallow: */buscar # Root solo a index.php: Disallow: /wp- Disallow: /xmlrpc.php # Tipos de ficheros # ----------------------- # # Htaccess Disallow: /*.htaccess$ # Ini Disallow: /*.ini$ # CSS Disallow: /*.css$ # JavaScript Disallow: /*.js$ # Scripts ## Perl Disallow: /*.pl$ # DENEGANDO ACCESOS POR ROBOTS: # ----------------------- # # Lista de bots que suelen respetar el robots.txt pero rara # vez hacen un buen uso del sitio y abusan bastante... # Añadir al gusto del consumidor... # Gracias a Sigt: http://sigt.net/archivo/robotstxt-para-wordpress.xhtml # Google AdSense (No tenemos Adsense, fuera.) User-agent: Mediapartners-Google* Disallow: / # Internet Archiver Wayback Machine User-agent: ia_archiver Disallow: / # digg mirror User-agent: duggmirror Disallow: / User-agent: MSIECrawler Disallow: / User-agent: WebCopier Disallow: / User-agent: HTTrack Disallow: / User-agent: Microsoft.URL.Control Disallow: / User-agent: libwww Disallow: / # Ralentizando algunos bots raros. User-agent: noxtrumbot Crawl-delay: 50 User-agent: msnbot Crawl-delay: 30 User-agent: Slurp Crawl-delay: 10