Igor's AI Bestie Simulator

My best friend and I communicate over chat lots (33,60,101 messages/day P50, P75, P95). I have years of chat history, and so this seemed like a super fun way to get deeper into ML. This is created and shared with his permission.

Simulation
Coaching/Mentoring
- Habit Tracking
Embodiment
Appendix
- Tooling learnings
- Upstream fixes

Simulation

Playing around with a bestie simulator

Idea: Inject Recent chat history

I’ve been thinking about how to inject “state”, perhaps the easiest thing is just inject our last few days of discussion. Need to do the math on how much room I have in the prompt …

Can also use semantic summarization if that’s too big

Attempt #1: Prompt GPT to simulate with lots of example text

Prompts didn’t work that well. Still sounded like GPT

Attempt #2: Use that data to do the training

Style transfer success! But then answer were too concise.
- TBD – add several examples
Next use a prompt to make the responses longer
- TBD – add several examples
Proof of concept
Use fine tune tool https://platform.openai.com/finetune
Split into training and validation
Started getting moderation errors when I went to weekly batches.
- Tried removing image unicode, and went to 1/10 the size, and that worked
- Trying again at full size, see if that works

Style vs Substance

Fine tuning makes it sound like my bestie, but a few observations:

It’s very bad at telling me about itself/its day
It’s pretty good at giving me advice.

I think part of the problem with 1, as fine tuning doesn’t have a sense of “context”, or a sense of “utility”. I think I need to inject those. So example context:

Current Global Events
- Interestingly, this is pretty shallow stuff, so don’t think it helps, beyond getting the interesting, brr it’s cold out
- Location
- Date
- Weather
Current Utility for bestie
- This is probably good, but maybe not as the key is reflection
What bestie knows I want to accomplish and care about
- Being explicit here is probably good
- Having bestie know my goals, and my things i get caught up in.

Models

Here are some models I fine tuned in case you get access to them

Bestie-1d-raw-2020+
- ftjob-qyOyRWqpuakIhQdulCSi60Ui
- I’m guessing performance gets weird on facts as they change over 5 year blocks
Bestie-7d-raw-2020+
- Having a hard time getting this to pass validation
Bestie-7d-raw-2020-sampled +
- ftjob-5V0Pkd9YtiAp4FsnPctJ29F1
- Having a hard time getting this to pass validation
Bestie-1d-raw-full
- No cleanups, just raw data
- ft:gpt-3.5-turbo-1106:idvorkinteam::8YgPRpMB
- Didn’t have a validation set, so not sure what that did

Data Prep

EASY: Merge consecutive lines by same person within 5 minutes.
HARD: Decide when you have a request/response, vs not

Useful links

Observation

For hard data tasks, I can use GPT to do the work, e.g. like how to split up into distinct conversations
Can finetune on daily or weekly. Daily sounds good, but a few problems:
- Conversations that span end of day break
- You have overhead for every training sample. From daily to weekly I went from 10M to 4M
- Need to pay attention to stay under token limit
ufffc is what apple sends for an image
A tuning run is 50$
I suspect the conversation gets weird if there’s too much history, as people change. Not knowing when something happens (a decay) on the training data is important. So can address by limiting to latest history

Coaching/Mentoring

Not exactly related, but let me merge some of these concepts in.

Tony the Tesla

Habit Tracking

I have daily and weekly habits. I use an app called streaks for that. But I could probably have an agent do this, especially for the ones I’ve done or not done. Lets try that.

Embodiment

Callable Agent Platforms

VAPI - ties the Transcription/TTS/S2T agent and func calls together to let you call your agent. That’s certainly where I’ll end up.

Discord Bot

Having the interface be discord is great. Easy and fun

Voice of our friends

Well that was super easy, eleven labs has perfect instant voice clones, and what’s coolest is you can clone in multiple languages, so you can hear yourself talking in lots of languages. Let me give a sample

This took me about 30 minutes to setup - crazy!!

The code

Video of our friends

Haven’t looked at this yet, but for sure it’s coming. You can probably make life easier by starting with still images and avatars.

Appendix

Tooling learnings

Pre-commit - Nicer version of husky (which was always kind of flaky)
Ruff - Nicer version of black. Written in Rust, so much faster, and also supports fixing some simple stuff, has nvim support
Path from pathlib. Lets you use Path.home() vs os.expanduser(), and a type safe path parameter, avoiding sending in strings by accident (God Bless Typing)

Upstream fixes

Pushed multiple PRs to better support iMessage chat format in Langchain 1 2