These feels like an ‘in two or four years’ maybe idea. Its a logical idea but it feels like way to early. Could be me and I hope I’m wrong but there’s a lot more ‘tricky’ parts than just weeding out bad training data.

Reply to this note

Please Login to reply.

Discussion

You’re right; it will take more than just data prep and likely more than 4 years to mature.

For example we need an architecture that allows for versioning of components, auditability, benchmarking for regression testing, etc. It also needs verifiable outputs so we can prove the output was generated without tampering.

And there are supplementary components like vector data stores so that users can store context for long running tasks.