~ cat /bot/info

ExamMint Index Engine Bot

Details about the official web crawler for the ExamMint notifications platform.

~ cat /bot/about

About ExamMintBot

Hello! If you are checking your server logs, you probably noticed requests from ExamMintBot. This page explains who we are, what our bot does, and how we respect your website resources.

What is ExamMintBot?

ExamMintBot is the official web crawler for ExamMint. We are a platform that helps exam aspirants find Indian government exam notifications.

Our bot visits public recruitment portals to collect official exam alerts. It gathers exam dates, eligibility criteria, and official PDF documents. We then structure this data to make it easy for candidates to read and understand.

We publish all of these organized updates online. The public notifications can be found directly on notifications.exammint.in (not publicly accessible yet).

Why We Scrape Public Portals

Finding government exam updates in India can be difficult. Notifications are often scattered across many different websites.

We visit these portals to make public information more accessible. We do not crawl private data, user profiles, or behind-the-wall content. We only look at public notices meant for the general public.

telemetry: bot_specs
Rust
ENGINE_LANGUAGE
0.8s - 2.0s
REQUEST_DELAY
< 2.0 MB
MAX_PDF_SIZE
Local
SMART_CACHE
Polite & Structured. Our scraper adheres to the strict ethical guidelines of web crawling to minimize server overhead.
[header_identity]

Official User-Agent

Mozilla/5.0 (compatible; ExamMintBot/1.0; +https://exammint.in/bot)
~ cat /bot/guidelines

Our Polite Scraping Guidelines

Our bot is built in Rust to be highly efficient and gentle on your servers.

No Server Hammering

We do not flood your website with traffic. Our bot implements random delays between downloads, ranging from 0.8 to 2.0 seconds per request.

Isolated Requests

We queue our portal visits carefully. We only scrape a small, conservative number of pages per site during each execution run.

Lightweight PDFs

We limit our PDF downloads to a maximum size of 2 MB. This protects your server's outbound bandwidth limits.

Identifying User-Agent

We transparently state who we are in our request headers, linking directly to this documentation page in the User-Agent.

Smart Caching

We cache data locally. If we have already downloaded a notice or verified it is unchanged, we do not download it again, saving server load.

Respectful Crawler

Our crawler acts respectfully. We follow standard robots.txt specifications and obey site admin crawl guidelines promptly.

~ cat /bot/contact

Contact & Opt-Out

Feel free to reach out to adjust speeds or request blocking.

[admin_contact]

Need to get in touch?

If our bot is causing any issues or excessive load on your site, please email us. We will adjust our crawl speed or resolve the problems immediately.

~ cat /etc/robots.txt

If you want to block our bot entirely from indexing your public resources, you can add the following rules to your robots.txt file:

User-agent: ExamMintBot
Disallow: /

Thank you for helping us keep government exam candidates informed!