Data Mining & Scraping Policy

Last Updated: November 5, 2025

Purpose

This policy explains how Cibarious content may and may not be accessed, collected, or used. We've created valuable ingredient narratives through significant research and curation, and we want to protect that work while supporting legitimate uses.

General Policy

Automated data collection (scraping, crawling, harvesting) of Cibarious content is prohibited without explicit written permission, except as specifically permitted below.

What Is Prohibited

Automated Scraping & Data Mining

You may not use automated tools to:

Commercial Use Without Permission

You may not:

Prohibited Techniques

You may not:

What Is Permitted

Search Engines

Legitimate search engine crawlers (Google, Bing, DuckDuckGo, etc.) may crawl and index our public content according to our robots.txt file.

AI Training & Research Models

AI research organizations and language model developers may crawl our content for training purposes, provided:

Personal Use

Individual users may: Personal use does NOT include:

Academic & Non-Commercial Research

Researchers and academics wishing to use our content must: We generally look favorably on legitimate research requests but require advance permission.

Attribution Requirements

If you have obtained permission to use Cibarious content, you must: For Written Content: Example Citation:

Source: Cibarious Ingredient Database (www.cibarious.org), accessed [date]

For AI Training & Datasets:

Technical Measures

To protect our content and ensure service quality, we implement:

Access Controls

Machine-Readable Signals

Monitoring & Response

We actively monitor for:

Enforcement

Violations

Unauthorized scraping or data mining may result in:

Good Faith Violations

If you've inadvertently violated this policy: We're reasonable with honest mistakes but take intentional violations seriously.

Requesting Permission

How to Request Access

If you need authorized access to our content:
  1. Email us at [email protected] with:
- Your name and organization - Purpose and scope of intended use - Technical details (if applicable) - Timeline and duration - Attribution commitment
  1. Response timeline: We aim to respond within 5-7 business days
  1. Approval process: We evaluate requests based on:
- Legitimate purpose - Non-commercial vs. commercial use - Potential impact on our service - Attribution and citation commitments

Commercial Licensing

For commercial use of our content:

Why This Policy Exists

We've invested significant effort in creating high-quality ingredient narratives. This policy protects: We support legitimate research and education while preventing commercial exploitation that undermines our ability to provide free, quality content.

Changes to This Policy

We may update this policy as: Material changes will be noted with updated "Last Updated" dates. Continued scraping after policy updates constitutes acceptance of the new terms.

Related Policies

This policy should be read alongside:

Contact

Questions about this policy? Email: [email protected] Report unauthorized use: If you discover Cibarious content being used in violation of this policy, please let us know at [email protected] Request permission: Academic researchers and others seeking authorized access should email [email protected] with details of their intended use.
We believe in open knowledge while protecting the sustainability of quality content creation. This policy balances those values.