Python Developer for AI Prototype (LLM + State Comparison, Short Project)
Upwork
NLT AI Summary
Python Developer for AI Prototype (LLM + State Comparison, Short Project)
________________________________________
Description
I’m looking for a developer to help build a lightweight AI prototype using OpenAI or Anthropic APIs.
This is NOT a full product build.
This is a focused prototype to test a specific idea.
________________________________________
Project Goal
Build a simple Python-based system that
1.Runs the same LLM task multiple times.
2. Captures outputs and any intermediate state (memory/logs).
3. Compares differences between runs.
4. Classifies differences into simple categories
o Stable
o Boundary
o Violation
________________________________________
What This Means
Think
• Run the same prompt 5–10 times.
• Log results.
• Detect where outputs or stored data differ.
• Label those differences.
That is it.
________________________________________
Technical Requirements
Must have
• Python
• Experience with OpenAI API or Anthropic API
• Ability to build simple, clean scripts (no over-engineering)
Nice to have
• LangChain or similar frameworks.
• Streamlit (for simple UI/dashboard).
• Experience with logging or comparing outputs.
________________________________________
Important Constraints
This should be
• Lightweight.
• fast to build.
• easy to understand.
Please DO NOT
• Design complex architectures.
• build full systems.
• over-engineer.
________________________________________
Deliverables
• Python script or small app.
• Ability to run repeated LLM tasks.
• Stored logs of runs (JSON or similar).
• Basic comparison logic between runs.
• Simple classification output.
________________________________________
Timeline
• 3–7 days initial build
• Max 1–2 weeks total
________________________________________
Engagement Style
• Fixed-price or hourly (open to discussion)
• Will start with a small paid test task before full project
________________________________________
Screening Question (Required)
Please answer this
If you needed to run the same LLM task multiple times and compare outputs/state between runs, how would you build it quickly?
________________________________________
Who This Is For
Ideal candidate
• Builds fast prototypes.
• Comfortable with LLM APIs.
• Prefers simple solutions over complex systems.
________________________________________
Bonus
If this goes well, there may be follow-on work.