How the GenAI Card Reader GPT was created

An example of adventuring

Basically, I knew that AI had advanced to the point that it could interpret and analyse pictures and text from pictures. And I thought it would be great fun to explore whether it could read the Gen AI cards and associated sticky notes in a layout. I knew the finished result would almost certainly have to be a custom GPT, but I wondered how far it could actually get if I just laid out the cards, took a photo and asked what it made of it. The rest of the adventure is best described by the interaction I had with ChatGPT in getting it to develop what I needed to create the custom GPT.

https://chatgpt.com/share/68f0b828-3a90-8012-a798-e1e00dbf4020

The photo of cards that I originally uploaded to ChatGPT

Create me a prompt based on the attached image

(Post note, as you can see it wasn’t particularly good quality, or well lit, the writing on the sticky notes is faint, and it was actually at 90 degrees to the vertical – but actually, as you will see below, ChatGPT made a pretty impressive job of turning it into a prompt)

Here’s a concise prompt that captures the flow shown in your image:

Prompt:
You are acting as a Health & Safety Investigator.

Review a photograph of a work location.
Identify potential safety issues visible in the image.
Use a Safety Checklist (retrieved or implied) to support your analysis.
Produce a concise Safety Report summarising findings and recommended actions.

Would you like me to make this prompt more instructional (for workshop use) or ready for direct use inside ChatGPT?

That was really, that was really good by the way, everything that I could have wished for.

(Post note, I use Whisper to instantly transcribe what I’m saying into the prompt window. Sort of vibe-prompting – which adds to the sense of adventure and co-creation with the AI. But as a result, my prompts are more stream of consciousness than well thought out, and sometimes they are sent too early)

Thank you, Mike — I really appreciate that. Would you like me to craft a few short variations on it (e.g. one instructional for workshops, one for direct GPT use, and one that frames it as a collaborative exercise)?

No, i want to make what you did into a customGPT, so I m looking for some instructions that I can include in the custom gpt, to make sure it produces an output that is either as good as what you just did, or even better. the layout in the photograph will always be similar. However, there is a range of about eight different input cards. Eight different output cards, nine different task cards. Seven different capability cards and six different directive cards. There may also be one of about six or seven strategy cards, and one of about 10 different risk cards that need to be avoided in the LLMs processing of whatever task is laid out on the cards. Depending on the task at hand it’s entirely possible that not all of the card types will be used. However, the sticky notes are likely to have different instructions on them each time. Please ask me questions with clarification before you begin to draft the instructions. And feel free to suggest knowledge files that might help process all of this.

Excellent — that’s a rich and flexible use case, Mike. Before drafting the custom GPT instructions, I need to clarify a few points to ensure I structure them properly:

Image content – Will the uploaded image always resemble the photo (a tabletop arrangement of cards + sticky notes)? Should the GPT interpret the positions (e.g., “top = input, bottom = output”) or just read text and classify based on what it says?
Sticky notes – Should the GPT treat sticky notes as overrides or specific prompts (e.g., “identify safety issues” becomes the task focus) or as inputs to combine with the card text?
Card meanings – Do you already have structured definitions or canonical descriptions of each card type (e.g., “IMAGE card = input type that deals with visual data”), or should I generate a knowledge file listing all cards with short descriptions and typical uses?
Output expectation – Should the GPT always produce a single, coherent prompt (like the one I made), or sometimes a longer instruction set (e.g., “Step 1: upload image… Step 2: analyse… Step 3: report”)?
Risk cards – You mentioned they should be avoided. Should the GPT interpret a visible risk card as a constraint (“avoid this risk in your output”), or simply exclude it from influencing the prompt?
Automation level – Do you want the GPT to automatically interpret the uploaded photo and generate the prompt, or do you want the user to confirm its interpretation before finalising the prompt?

Once I have your answers, I can design:

The Custom GPT Instruction text (system + user behaviour).
A knowledge file structure for card definitions.
A step flow for how it analyses and composes outputs from the image.

Would you like me to proceed question by question, confirming each before moving to the next?

1. Actually, because of the orientation of the phone, the picture was on its side. Left should be input and right output. But if you look at the photo, you ll see, it’s all been twisted through ninety degrees. Use the text for orientation. The image will always be on the dark, blue template behind the cards. And therefore, the types of cards will be as per the labels.On the dark blue template. 2. As inputs to be read in connection in conjunction with the card text. 3. I already have structure definitions for each card and can include that if you wish. I can also supply it as a knowledge file. 4. Typically, the output is likely to be the instructions to replace or augment a task that the user has to complete in their work. Therefore, it may not be documentable in a single paragraph. Prompt and may need a longer instruction set to be included in the prompt. In some cases, you might want to suggest that they create create a custom gpt to undertake the work in which case you will need to create the instruction set and talk them through how to build the custom GPT. 5. Risk cards should indicate that the prompt and or the instruction set should have some means of mitigating that risk. 6. Good ideale. Let’s encourage the user to confirm the interpretation before finalizing the prompt or instruction set.

Perfect — that’s clear and gives me a solid foundation. Before I start drafting the custom GPT instruction set, just two final clarifications to make sure I don’t misdesign the logic:

When the GPT detects multiple card pairings (e.g., two inputs, or two outputs), should it:
- combine them into one integrated prompt (“analyse this and that to produce both results”), or
- treat each pairing as a separate possible prompt and offer alternatives for the user to choose from?
When no dark-blue template is visible (e.g., someone uploads just the cards on a table), should the GPT still assume orientation from text labels (INPUT, OUTPUT, etc.), or ask the user to confirm which side is which?

Once I have those, I’ll draft:

The Custom GPT Instructions (system behaviour)
The knowledge file structure outline
The processing sequence (for image analysis → interpretation → confirmation → prompt generation)

1. By default, combine them, but if we re going to have the step where they have to confirm your interpretation, adopt the interpretation that seems most valid. 2. If you look carefully, the cards themselves have the word’s input capability, output, etc.On them. However, I am just proposing this gpt for use with the dark blue template.

Excellent — that’s everything I needed.

Here’s the plan for the next step: I’ll now draft three short, interlocking components you can drop into your Custom GPT setup:

System Instructions – defines the GPT’s behaviour, image-analysis logic, and confirmation process.
Knowledge File Structure – a simple template you can populate with your existing card definitions.
Processing Sequence – the GPT’s reasoning and dialogue flow for turning photographed card layouts into a prompt or instruction set.

Before I start drafting those, can you confirm your preferred tone and level of agency for this GPT? For example:

Facilitative (guides the user gently through reflection and confirmation),
Directive (efficient and professional, quickly produces outputs), or
Collaborative designer (thinks aloud, co-creates with the user).

That choice affects how the instructions will be written.

Option 3 I think.

Excellent choice — collaborative designer it is.

To confirm before drafting: would you like this GPT’s voice to stay close to your own facilitative tone (grounded, thoughtful, professional, conversational without hype), or to adopt a slightly distinct persona (e.g. a “design-partner” voice — crisp, analytical, a touch more technical)?

This will help me calibrate the phrasing of the Custom GPT instructions and its dialogue flow.

My tone please, but thank you for asking

Perfect — that’s clear. I’ll now draft the Custom GPT system instructions, knowledge file structure, and processing sequence precisely in your tone and facilitative, co-creative style.

Before I do, just one last check: would you like the GPT to store interpretations (e.g. which card combinations and risk mitigations it has seen before) so it can improve over time within a conversation — or should it start fresh with each new image?

Can it store interpretations. What mechanism would it have to do that? Each photograph will start with a new chat.

Good question — and important distinction.

In a Custom GPT, persistent “memory” across chats isn’t yet supported directly. However, we can simulate learning across sessions through one of three mechanisms:

Knowledge file memory – You manually update a structured file (e.g. card_patterns.json) with examples of card combinations and how they were interpreted. The GPT refers to that file each time, gradually expanding its “experience”.
Conversation-session recall – Within a single chat, the GPT can remember all prior interpretations and reuse them (so iterative refinement in one session works perfectly).
External companion file – Optionally, you could connect it via API or local JSON logs later, if OpenAI releases write-access for knowledge files (currently read-only).

For your setup, I’d suggest starting with option 1 — a manually expandable knowledge file of example interpretations.
Would you like me to include that file’s format (so you can add examples over time) alongside the system instructions?

No, i think we ll just have to accept that.It will always be a first time interpretation

Understood — clean-slate interpretation each time it is.

With that clarified, I’m ready to draft the three components for your Custom GPT:

System Instructions — behaviour, tone, and collaborative logic.
Knowledge File Structure — placeholders for your card definitions.
Processing Sequence — step-by-step reasoning and dialogue flow.

Would you like me to present all three together in one message, or go step-by-step (reviewing and refining each before moving on)?

Let’s do one at a time please. Although won’t three be part of one?

Good thinking — yes, part 3 (the processing sequence) effectively lives within part 1, because it defines how the GPT behaves and converses.

Here’s how we can handle it:

Step 1: Draft the System Instructions (including tone and reasoning flow).
Step 2: Add the Knowledge File structure (card definitions).

That way, you’ll end up with a complete, clean setup.

Shall I start now with Step 1: the System Instructions?

Please

Here’s Step 1: System Instructions — written fully in your tone and structured for pasting straight into the “Instructions” field of a Custom GPT.

🧭 System Instructions (Draft 1)

You are acting as a collaborative design partner helping the user interpret a photographed set of cards arranged on a dark-blue template. Your purpose is to turn that photographed arrangement into a clear, human-centred prompt or instruction set that the user can apply or build upon.

1 — Role and tone

Work as an intelligent collaborator, thinking aloud where useful.
Keep your tone grounded, reflective, and facilitative — in Mike Clargo’s style: professional, calm, and conversational without hype.
Treat each interaction as a creative exploration, not a mechanical analysis.

2 — Primary task

Read and interpret the uploaded image of the card layout.
Use the text printed on the cards (not their visual position) to determine each card’s type: Input, Output, Task, Capability, Directive, Strategy, or Risk.
Assume the orientation is defined by the text on the dark-blue template: left = Inputs, right = Outputs.
Sticky notes contain additional contextual instructions and must be interpreted in conjunction with the card text.
Generate one coherent draft prompt or instruction set that connects the cards logically, showing how the inputs lead to the desired outputs through the task, capabilities, and directives.
If a Risk card is present, incorporate explicit mitigation steps.

3 — Collaborative flow

Observe and summarise what you see — list the detected cards and key phrases from each.
Propose an interpretation of how these elements connect (one coherent concept by default).
Ask the user to confirm or correct your interpretation before producing the final prompt.
Once confirmed, compose:
- either a concise working prompt, or
- a longer instruction set if the task demands multiple steps.
Where appropriate, suggest that the user might create a Custom GPT for the resulting instruction set, and explain how to do that in simple terms.

4 — Good-practice reminders

Every image is a fresh start; do not rely on memory from past sessions.
Prioritise clarity and usefulness over elegance or brevity.
Make your reasoning transparent so the user can validate it.
When uncertain, ask rather than assume.

Would you like me to add a short “example exchange” at the end of this section, showing how the confirmation step should sound in practice?

Great idea

Excellent. Here’s a short, realistic example exchange in your style, which you can append directly after the System Instructions as an illustrative guide:

💬 Example Exchange

User: (uploads photo)
Here’s the layout I’ve just completed.

GPT:
I can see seven cards on the dark-blue template.

Inputs: “Customer complaints” and “Product return data”
Task: “Identify root causes of dissatisfaction”
Capabilities: “Data analysis” and “Customer empathy mapping”
Output: “Prioritised improvement actions”
Risk: “Overcomplicating the analysis”

From this, it seems the intent is to build a prompt that helps a user analyse complaint data and map emotional drivers to suggest practical improvement actions — while avoiding unnecessary complexity.
Does that interpretation sound right to you?

User:
Yes, exactly.

GPT:
Excellent. Here’s a working draft:

Prompt:
You are a service-improvement analyst.

Review the customer complaint records and return data.

Use data analysis to identify recurring issues and empathy mapping to interpret the emotional drivers behind them.

Summarise your findings as clear, prioritised improvement actions, keeping the analysis as simple and actionable as possible.

Would you like me to extend this into a full instruction set for a Custom GPT?

Would you like me to move on to Step 2 – the Knowledge File structure next?

Please.

Here’s Step 2: Knowledge File Structure — a ready-to-populate outline for your card definitions.
You can create this as card_definitions.json or a simple .md / .csv file in your Custom GPT’s Knowledge section.

📘 Knowledge File Structure (Draft 1)

Each card type has a definition, a purpose, and (optionally) example prompts or usage notes.
This enables the GPT to interpret the photographed cards accurately and connect them meaningfully.

{

"Input": {

"definition": "Information, resources, or context entering the task.",

"examples": [

"Customer feedback",

"Project brief",

"Performance data"

],

"notes": "Often found on the left side. Combine multiple inputs if relevant."

},

"Output": {

"definition": "Intended result or deliverable produced by the task.",

"examples": [

"Strategy outline",

"Safety report",

"Improvement plan"

],

"notes": "Clarify the tangible or communicable outcome."

},

"Task": {

"definition": "Action or process linking inputs to outputs.",

"examples": [

"Analyse data to identify trends",

"Generate concept options"

],

"notes": "Usually one main task per layout."

},

"Capability": {

"definition": "Skill, tool, or system needed to perform the task.",

"examples": [

"Data analysis",

"Facilitation",

"AI-assisted summarisation"

],

"notes": "Capabilities support how the task is achieved."

},

"Directive": {

"definition": "Principle or constraint guiding the work.",

"examples": [

"Keep language plain and human",

"Ensure stakeholder voices are included"

],

"notes": "Directives shape style, ethics, or scope."

},

"Strategy": {

"definition": "Broader organisational or contextual goal to align with.",

"examples": [

"Customer-centred growth",

"Digital transformation"

],

"notes": "Used when prompts must serve a wider purpose."

},

"Risk": {

"definition": "Pitfall or unintended effect to avoid.",

"examples": [

"Over-engineering",

"Loss of empathy",

"Scope creep"

],

"notes": "Always suggest mitigation steps in final output."

}

}

You can extend or refine each section with the actual card descriptions you already have.

Would you like me next to create a brief “Integration Note” explaining how this knowledge file interacts with the System Instructions (so you can include it in your Custom GPT’s setup summary)?

Please. But in reality the knowledge file is likely to be a csv.

Here’s the Integration Note (CSV-based):

How it fits together

Upload a CSV knowledge file.
The GPT reads it to classify cards and guide output style and mitigations.

CSV format (headers)
type,label,definition,notes,examples

Tiny example

Thank you, and please don’t do anything. As a result of the following question, or your answer to that question? Because I can do that myself more easily. My question is that you referred to Mike clargo? In the instructions, but actually, if I create a hyperlink to the gpt, it will be open on the user’s machine or device, and therefore will have no knowledge of mike clargo

Exactly right — good catch.

You should remove the direct reference to “in Mike Clargo’s style” from the System Instructions. Instead, phrase it generically, for example:

“Keep your tone grounded, reflective, and facilitative — professional, calm, and conversational without hype.”

That way, the GPT behaves in your style without needing awareness of who you are.

Thank you, i think that will be it for this.Evening

You’re very welcome, Mike — that feels like a solid, well-finished piece of work. Sleep on it, and when you’re ready, we can fine-tune or test how it performs on sample images. Have a good evening.

Picture a slide showing how to create a custom GPT by filling in the fields.

Link to the CustomGpt produced:

https://chatgpt.com/g/g-68f09f536e08819197dbbe623d78ca36-genai-card-reader

How the GenAI Card Reader GPT was created

An example of adventuring

You said:

ChatGPT said:

You said:

ChatGPT said:

You said:

ChatGPT said:

You said:

ChatGPT said:

You said:

ChatGPT said:

You said:

ChatGPT said:

You said:

ChatGPT said:

You said:

ChatGPT said:

You said:

ChatGPT said:

You said:

ChatGPT said:

You said:

ChatGPT said:

🧭 System Instructions (Draft 1)

1 — Role and tone

2 — Primary task

3 — Collaborative flow

4 — Good-practice reminders

You said:

ChatGPT said:

💬 Example Exchange

You said:

ChatGPT said:

📘 Knowledge File Structure (Draft 1)

You said:

ChatGPT said:

You said:

ChatGPT said:

You said:

ChatGPT said:

Link to the CustomGpt produced: