Python Developer for AI Prototype (LLM + State Comparison, Short Project)

________________________________________

Description

I’m looking for a developer to help build a lightweight AI prototype using OpenAI or Anthropic APIs.

This is NOT a full product build.

This is a focused prototype to test a specific idea.

________________________________________

Project Goal

Build a simple Python-based system that

1.Runs the same LLM task multiple times.

2. Captures outputs and any intermediate state (memory/logs).

3. Compares differences between runs.

4. Classifies differences into simple categories

o Stable

o Boundary

o Violation

________________________________________

What This Means

Think

• Run the same prompt 5–10 times.

• Log results.

• Detect where outputs or stored data differ.

• Label those differences.

That is it.

________________________________________

Technical Requirements

Must have

• Python

• Experience with OpenAI API or Anthropic API

• Ability to build simple, clean scripts (no over-engineering)

Nice to have

• LangChain or similar frameworks.

• Streamlit (for simple UI/dashboard).

• Experience with logging or comparing outputs.

________________________________________

Important Constraints

This should be

• Lightweight.

• fast to build.

• easy to understand.

Please DO NOT

• Design complex architectures.

• build full systems.

• over-engineer.

________________________________________

Deliverables

• Python script or small app.

• Ability to run repeated LLM tasks.

• Stored logs of runs (JSON or similar).

• Basic comparison logic between runs.

• Simple classification output.

________________________________________

Timeline

• 3–7 days initial build

• Max 1–2 weeks total

________________________________________

Engagement Style

• Fixed-price or hourly (open to discussion)

• Will start with a small paid test task before full project

________________________________________

Screening Question (Required)

Please answer this

If you needed to run the same LLM task multiple times and compare outputs/state between runs, how would you build it quickly?

________________________________________

Who This Is For

Ideal candidate

• Builds fast prototypes.

• Comfortable with LLM APIs.

• Prefers simple solutions over complex systems.

________________________________________

Bonus

If this goes well, there may be follow-on work.

Python Developer for AI Prototype (LLM + State Comparison, Short Project)

NLT AI Summary

Build a simple Python-based system that

4. Classifies differences into simple categories

Think

Must have

Nice to have

This should be

Please DO NOT

Please answer this

Ideal candidate

Stay on the Nerd Track