core

Create messages for language models like Claude and OpenAI GPTs.

from IPython.display import Image, display
from pathlib import Path

from pathlib import Path

API Exploration

Anthropic’s Claude and OpenAI’s GPT models are some of the most popular LLMs.

Let’s take a look at their APIs and to learn how we should structure our messages for a simple text chat.

openai

from openai import OpenAI

client = OpenAI()

client.responses.create(
  model="gpt-4.1",
  input=[ {"role": "user", "content": "Hello, world!"} ]
)

Hello, world! 👋 How can I help you today?

id: resp_68a4d422397c81a19fb6b4d899b25e7b029dfdae1caafc84
created_at: 1755632674.0
error: None
incomplete_details: None
instructions: None
metadata: {}
model: gpt-4.1-2025-04-14
object: response
output: [ResponseOutputMessage(id=‘msg_68a4d42338fc81a1ac7fba8e496af1a1029dfdae1caafc84’, content=[ResponseOutputText(annotations=[], text=‘Hello, world! 👋 How can I help you today?’, type=‘output_text’, logprobs=[])], role=‘assistant’, status=‘completed’, type=‘message’)]
parallel_tool_calls: True
temperature: 1.0
tool_choice: auto
tools: []
top_p: 1.0
background: False
max_output_tokens: None
max_tool_calls: None
previous_response_id: None
prompt: None
prompt_cache_key: None
reasoning: Reasoning(effort=None, generate_summary=None, summary=None)
safety_identifier: None
service_tier: default
status: completed
text: ResponseTextConfig(format=ResponseFormatText(type=‘text’), verbosity=‘medium’)
top_logprobs: 0
truncation: disabled
usage: ResponseUsage(input_tokens=11, input_tokens_details=InputTokensDetails(cached_tokens=0), output_tokens=14, output_tokens_details=OutputTokensDetails(reasoning_tokens=0), total_tokens=25)
user: None
store: True

anthropic

from anthropic import Anthropic

client = Anthropic()

client.messages.create(
    model="claude-3-haiku-20240307",
    max_tokens=1024,
    messages=[ {"role": "user", "content": "Hello, world!"} ]
)

Hello! It’s great to meet you. I’m an AI assistant created by Anthropic. I’m here to help with a wide variety of tasks, from analysis and research to creative projects and casual conversation. Please let me know if there’s anything I can assist you with.

id: msg_01LVFGsTHhwM65ESmydXgm3o
content: [{'citations': None, 'text': "Hello! It's great to meet you. I'm an AI assistant created by Anthropic. I'm here to help with a wide variety of tasks, from analysis and research to creative projects and casual conversation. Please let me know if there's anything I can assist you with.", 'type': 'text'}]
model: claude-3-haiku-20240307
role: assistant
stop_reason: end_turn
stop_sequence: None
type: message
usage: {'cache_creation': {'ephemeral_1h_input_tokens': 0, 'ephemeral_5m_input_tokens': 0}, 'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 11, 'output_tokens': 60, 'server_tool_use': None, 'service_tier': 'standard'}

As we can see both APIs use the exact same message structure.

mk_msg

Ok, let’s build the first version of mk_msg to handle this case

def mk_msg(content:str, role:str="user")->dict:
    "Create an OpenAI/Anthropic compatible message."
    return dict(role=role, content=content)

Let’s test it out with the OpenAI API. To do that we’ll need to setup two things:

install the openai SDK by running pip install openai
add your openai api key to your env vars export OPENAI_API_KEY="YOUR_OPEN_API_KEY"

oa_cli = OpenAI()

r = oa_cli.responses.create(
  model="gpt-4o-mini",
  input=[mk_msg("Hello, world!")]
)
r.output_text

'Hello! How can I assist you today?'

Now, let’s test out mk_msg on the Anthropic API. To do that we’ll need to setup two things:

install the openai SDK by running pip install anthropic
add your anthropic api key to your env vars export ANTHROPIC_API_KEY="YOUR_ANTHROPIC_API_KEY"

a_cli = Anthropic()

r = a_cli.messages.create(
    model="claude-3-haiku-20240307",
    max_tokens=1024,
    messages=[mk_msg("Hello, world!")]
)
r.content[0].text

"Hello! I'm an AI assistant created by Anthropic. It's nice to meet you. How can I help you today?"

So far so good!

Helper Functions

Before going any further, let’s create some helper functions to make it a little easier to call the OpenAI and Anthropic APIs. We’re going to be making a bunch of API calls to test our code and typing the full expressions out each time will become a little tedious. These functions won’t be included in the final package.

def openai_chat(msgs: list)->tuple:
    "call the openai chat responses endpoint with `msgs`."
    r = oa_cli.responses.create(model="o4-mini", input=msgs)
    return r, r.output_text

Let’s double check that mk_msg still works with our simple text example from before.

_, text = openai_chat([mk_msg("Hello, world!")])
text

'Hello there! How can I assist you today?'

def anthropic_chat(msgs: list)->tuple:
    "call the anthropic messages endpoint with `msgs`."
    r = a_cli.messages.create(model="claude-sonnet-4-20250514", max_tokens=1024, messages=msgs)
    return r, r.content[0].text

and Anthropic…

_, text = anthropic_chat([mk_msg("Hello, world!")])
text

'Hello! Nice to meet you. How are you doing today? Is there anything I can help you with?'

Images

Ok, let’s see how both APIs handle image messages.

openai

import base64, httpx

img_url = "https://claudette.answer.ai/index_files/figure-html/cell-35-output-1.jpeg"

mtype = "image/jpeg"
img_content = httpx.get(img_url).content

img = base64.b64encode(img_content).decode("utf-8")

client = OpenAI()
r = client.responses.create(
    model="gpt-4o-mini",
    input=[
        {
            "role":"user",
            "content": [
                {"type":"input_text","text":"What's in this image?"},
                {"type":"input_image","image_url":f"data:image/jpeg;base64,{img}"},
            ],
        }
    ],
)
r.output_text

'The image features a puppy lying on the grass near a cluster of purple flowers. The puppy has a brown and white coat, with large, expressive eyes and floppy ears, giving it an adorable appearance.'

anthropic

mtype = "image/jpeg"
img = base64.b64encode(img_content).decode("utf-8")

client = Anthropic()
r = client.messages.create(
    model="claude-3-haiku-20240307",
    max_tokens=1024,
    messages=[
        {
            "role":"user",
            "content": [
                {"type":"text","text":"What's in this image?"},
                {"type":"image","source":{"type":"base64","media_type":mtype,"data":img}}
            ],
        }
    ],
)
r.content[0].text

'The image shows a close-up of a cute puppy lying in the grass. The puppy appears to be a Cavalier King Charles Spaniel, with a fluffy brown and white coat. The puppy is looking directly at the camera with a friendly, curious expression. In the background, there are some purple daisy-like flowers blooming, adding a nice natural setting to the scene.'

Both APIs format images slightly differently and the structure of the message content is a little more complex.

In a text chat, content is a simple string but for a multimodal chat (text+images) we can see that content is a list of dictionaries.

Msg Class

Basics

Let’s create _mk_img to make our code a little DRY’r.

Exported source

def _mk_img(data:bytes)->tuple:
    "Convert image bytes to a base64 encoded image"
    img = base64.b64encode(data).decode("utf-8")
    mtype = mimetypes.types_map["."+imghdr.what(None, h=data)]
    return img, mtype

To handle the additional complexity of multimodal messages let’s build a Msg class for the content data structure:

{
    "role": "user",
    "content": [{"type": "text", "text": "What's in this image?"}],
}

API Exploration

openai

anthropic

mk_msg

Helper Functions

Images

openai

anthropic

Msg Class

Basics

Msg

OpenAiMsg

AnthropicMsg

Msg.mk_content

OpenAiMsg.text_msg

OpenAiMsg.img_msg

AnthropicMsg.text_msg

AnthropicMsg.img_msg

mk_msg

PDFs

AnthropicMsg.pdf_msg

Conversation

SDK Objects

Msg.__call__

AnthropicMsg.find_block

AnthropicMsg.is_sdk_obj

OpenAiMsg.find_block

OpenAiMsg.is_sdk_obj

mk_msgs

Usage

Extra features

Caching

mk_msgs_anthropic

mk_msg_anthropic

Citations

mk_ant_doc

Msg.call