Name: Future AI Open Training Data
Creator: Future AI
License: https://creativecommons.org/licenses/by/4.0/

Founding Record

The genesis commitment.

On 24 March 2026, Future AI published a cryptographically signed commitment to open data. This hash is a permanent, tamper-evident proof of that promise — verifiable by anyone.

genesis.commitment — SHA-256 signed

Verified

Entity

Future AI

Committed

24 Mar 2026 13:56:17 (+0000)

SHA-256 Fingerprint

169925497bfa2aef219606e25a6c94dee7f28bbf70ad3a657346540f6edcf873

How to independently verify

1Copy the commitment statement from the block above

2Run: echo -n "[statement]" | sha256sum

3Compare output to the fingerprint above — they must match exactly

4The timestamp was recorded by a public NTP-synchronised server

ISO 8601:2026-03-24T13:56:17Z

Open Data Philosophy

What open data means in the context of AI.

Open data in the context of artificial intelligence means more than publishing a dataset once a year. It means that every table, every field, every piece of information that a platform collects about its users is documented, accessible, and understandable — in real time, by the people it belongs to. Future AI was built around this principle from its first line of code. The schema documented on this page is not a marketing exercise. It is the actual database schema in production, updated whenever a structural change is made.

When people talk about AI transparency, they typically mean explainability — the ability to understand why a model produces a particular output. That matters. But there is a more fundamental layer of transparency that is almost universally ignored by AI platforms: data transparency. Before you can understand what an AI knows, you need to understand what data was used to build it, what data it continues to collect, and what happens to that data at every stage of the pipeline. This page answers all three questions in plain language.

The Future AI data pipeline works as follows. When a user sends a message in the workspace, that message is stored in a private messages table that only the user can read. The AI generates a response, which is stored alongside the original message. The server counts the words in the exchange. If the cumulative word count crosses the 10,000-word threshold, one Gold Coin is credited to the user's account and logged in the coin_transactions table. The question-and-answer pair is then anonymised — all user identifiers are removed — and stored in the knowledge_base table, which is readable by anyone.

This architecture was designed to satisfy the highest standard of data minimisation under the General Data Protection Regulation. The platform collects only what is necessary to deliver the service: a profile with a display name and email address, a record of conversations and messages, a coin balance and transaction history, and a list of unlocked features. No location data. No behavioural fingerprinting. No advertising profiles. No data enrichment from third-party brokers.

The open-data commitment extends to the knowledge base itself. Most AI companies treat their training data as a proprietary asset — a competitive moat that justifies keeping it secret. Future AI takes the opposite view. The knowledge base that the AI uses to answer questions was built from the intellectual contributions of real users. Those contributions should be readable by the community that made them. Making the knowledge base publicly accessible means that errors can be identified and corrected by anyone, not just by the engineers who built the system. It is a more robust and more trustworthy approach to AI knowledge management.

Every entry in the knowledge base is tagged with a topic, ranked by community confirmations, and searchable by keyword. The full knowledge base is available at /knowledge-base and is updated continuously. Researchers, journalists, and curious users are welcome to read it, cite it, and build on it. No API key required. No rate limits on public reads. That is what open data looks like in practice.

Open Data Commitment

Six guarantees, no asterisks.

Full visibility

Every field we store about you is documented on this page. No hidden columns, no shadow profiles, no inference databases you can't see.

You own your data

Export everything as JSON at any time. Delete everything permanently with one click. No cooldown periods, no 30-day delays.

Open knowledge

The AI knowledge base is fully public. Every anonymous Q&A pattern the AI has learned from is readable — and searchable — by anyone.

No data sales

Your data is never sold, licensed, or shared with third parties. It is used only to deliver the service and improve AI response quality.

Documented schema

Every database table and field is listed below with its purpose, data type, and who can read it. Scroll down to see the full schema.

GDPR by design

Deletion requests are logged, processed, and confirmed. Article 17 right to erasure is implemented — not just promised.

Full Data Schema

Every table. Every field. Every purpose.

Each row below is a real database column. Click any table to expand its full field list. Visibility tells you exactly who can read each field.

Private — only you

Public — anyone

Platform only

FieldTypeDescriptionVisibility

iduuidEntry identifierPublic

questiontextAnonymised question pattern extracted from conversations — no user identifiersPublic

answertextAI-generated response that was confirmed useful by the communityPublic

topictextCategory label (identity, coins, features, general…)Public

upvotesintegerThumbs-up count from users confirming this answer is accuratePublic

downvotesintegerThumbs-down count — entries with high downvotes are reviewedPublic

sourcetextHow the entry was added: 'user' (from conversation) or 'feedback' (from explicit rating)Public

created_attimestampWhen this entry was first createdPublic

updated_attimestampLast time vote counts were updatedPublic

Data Flow

What happens to every message you send.

From the moment you press Enter to the moment a pattern enters the public knowledge base — every step documented.

1

You type a message

Stored in messages table. Linked to your conversation. Only you can read it.

2

AI generates reply

Processed via edge function through the AI inference engine. Reply stored alongside your message.

3

Word count → coin check

Server counts words. If 10,000 threshold crossed, 1 coin added. Logged in coin_transactions.

4

Knowledge base update

Question + answer anonymised and stored in knowledge_base. No name, no ID. Just the pattern.

5

Public knowledge

The anonymised entry is immediately readable by anyone on this page and /knowledge-base.

The anonymisation step is non-negotiable. Before any message pattern enters the knowledge base, all user identifiers (id, name, email) are stripped. The knowledge base stores only the shape of a question and its answer — never who asked it.

Live — updates in real time

What the AI currently knows.

This is the public knowledge base — the exact data the AI uses to answer questions. Readable by anyone, searchable, fully anonymised.

16

Total entries

7

Topics covered

1,206

Community confirmations

16 entries · showing all

View full knowledge base

GDPR Compliance

Your rights, with direct links to exercise them.

Every GDPR right is fully implemented — not just listed in a policy document.

Right to Access

GDPR Art. 15

View at /my-data

See every field stored about you, your full conversation list, coin history, and unlocked features.

Right to Portability

GDPR Art. 20

Export JSON

Download a complete machine-readable JSON export of everything — profile, coins, conversations, transactions.

Right to Rectification

GDPR Art. 16

Edit in Settings

Update your name, email, bio, avatar, or learning preference at any time.

Right to Erasure

GDPR Art. 17

Delete in Settings

Delete all your data (keep account) or permanently delete your entire account. Immediate and irreversible.

GDPR Compliant by design

Data minimisation · Purpose limitation · Storage limitation · Integrity & confidentiality

Why This Matters

The case for radical AI transparency.

The dominant business model of the AI industry is built on information asymmetry. Users provide data — through their queries, their corrections, their behavioural patterns — and that data is used to train and improve models that are then sold back to those same users as a premium service. The users have no visibility into how their contributions are used, no way to verify what the model has learned, and no mechanism to correct errors or remove their contributions after the fact. This is a structural imbalance that benefits platforms at the expense of the people who make them valuable.

Future AI was designed to break this pattern. The Gold Coin system exists because users who contribute knowledge should receive tangible value in return for that contribution — not just an improved product that they cannot inspect or influence. The open knowledge base exists because the intellectual output of a community belongs to that community. The public schema documentation exists because users have a right to know exactly what data an AI platform holds about them, and that right should not depend on filing a GDPR access request and waiting 30 days for a response.

Radical transparency is not just a privacy feature. It is a quality control mechanism. When the knowledge base is public, inaccurate entries are visible to anyone and can be challenged. When the data schema is documented, security researchers can identify structural risks before they become vulnerabilities. When the data flow is published in plain language, users can make genuinely informed decisions about what they share and what they keep private. Opacity protects platforms. Transparency protects users. Future AI has chosen to protect users.

The GDPR rights section above is not a legal checkbox. Every right listed — access, portability, rectification, erasure — is implemented as a direct action available inside the platform, reachable in two clicks from the settings page. There are no cooldown periods. There are no 30-day response windows. When you delete your data, it is deleted immediately and permanently. When you export your data, the export is generated in real time as a structured JSON file that you can read and use with standard tools.

The genesis commitment published on 24 March 2026 is a cryptographically signed document that records the open-data pledge at a specific moment in time. The SHA-256 hash on this page is a tamper-evident fingerprint of that commitment. If the text of the commitment were ever changed without announcement, the hash would no longer match. This is a technical guarantee, not a trust-me guarantee. Anyone with access to a SHA-256 calculator can verify it independently.

Future AI believes that the next generation of AI infrastructure should be built on a foundation of verifiable trust rather than institutional trust. Verifiable trust does not require users to take the platform's word for anything. It provides the tools and the documentation for users to verify the platform's claims themselves. This page is one of those tools. The knowledge base is another. The full data export is a third. Together, they represent a model for how AI platforms should operate — and a direct challenge to the opacity that currently defines the industry.

The first AI that shows you
everything it knows about you.

The genesis commitment.

What open data means in the context of AI.

Six guarantees, no asterisks.

Full visibility

You own your data

Open knowledge

No data sales

Documented schema

GDPR by design

Every table. Every field. Every purpose.

What happens to every message you send.

What the AI currently knows.

Your rights, with direct links to exercise them.

The case for radical AI transparency.

Start using the first truly
transparent AI platform.

The first AI that shows youeverything it knows about you.

The genesis commitment.

Six guarantees, no asterisks.

Full visibility

You own your data

Open knowledge

No data sales

Documented schema

GDPR by design

Every table. Every field. Every purpose.

What happens to every message you send.

What the AI currently knows.

Your rights, with direct links to exercise them.

Start using the first trulytransparent AI platform.

The first AI that shows you
everything it knows about you.

Start using the first truly
transparent AI platform.