Hvad er Robots.txt?
Hurtig definition
Robots.txt er en tekstfil, der informerer søgemaskine-bots om, hvilke sider eller sektioner af webstedet der ikke bør crawles eller indekseres.
The robots.txt file is one of the first things search engine bots check when visiting your website. Located at yoursite.com/robots.txt, it contains directives that tell crawlers which parts of your site they can access and which they should avoid. It uses a simple syntax with User-agent (which bot), Disallow (which paths to skip), and Allow (exceptions to disallow rules).
Common uses include blocking search engines from crawling admin areas, staging environments, duplicate content (like print versions of pages), internal search result pages, and private user account areas. You can also use it to point search engines to your sitemap file.
It's important to understand that robots.txt is a polite request, not a security measure. Well-behaved bots like Googlebot respect it, but malicious bots may ignore it entirely. Sensitive content should be protected with authentication, not robots.txt.
Misconfigured robots.txt files are one of the most common technical SEO mistakes. A single misplaced directive can accidentally block your entire site from being indexed, or prevent search engines from accessing CSS and JavaScript files they need to properly render your pages.
Hvorfor det er vigtigt
Robots.txt directly controls what search engines can and cannot see on your website. A well-configured file helps search engines focus their limited crawl budget on your most important pages. A misconfigured one can make your entire website invisible to Google.
For large websites, robots.txt is essential for crawl budget management — preventing bots from wasting time on low-value URLs means they spend more time indexing the pages that matter.
Eksempler fra den virkelige verden
A company's new developer accidentally added Disallow: / to robots.txt, blocking Google from their entire site and causing traffic to drop 90% before anyone noticed
An e-commerce site blocked their faceted navigation URLs via robots.txt, saving thousands of pages of crawl budget for their actual product pages
A multi-site WordPress installation used robots.txt to prevent staging site content from being indexed by search engines
A SaaS platform blocked /app/ and /account/ paths to prevent internal dashboard pages from appearing in search results
Relaterede termer
Technical SEO
Teknisk SEO er processen med at optimere dit websteds infrastruktur, så søgemaskiner effektivt kan crawle, indeksere og gengive dine sider.
Crawl Budget
Crawl budget er det antal sider, en søgemaskinebot vil besøge på dit websted inden for et givet tidsinterval.
Sitemap
Et sitemap er en fil eller webside, der lister alle sider på et websted og hjælper søgemaskiner med at opdage og indeksere indhold mere effektivt.
Indexing
Indeksering er den proces, hvor søgemaskiner analyserer og gemmer information om websider i deres database og gør dem tilgængelige i søgeresultater.
Har du brug for hjælp med robots.txt?
Vores team kan hjælpe dig med at omsætte dette i praksis. Få en gratis konsultation for at diskutere dit projekt.