AI Model Evaluations
The cluster focuses on discussions about evaluating, comparing, and using various AI/ML models, including debates on their quality, accuracy, usability for specific tasks, and the need for transparency or improvements.
Activity Over Time
Top Contributors
Keywords
Sample Comments
If it’s based on models, it’s only as good as the model
Curious, what would have been a better model?
Hi, so what are you using the models for?
I have nothing to gain from spending time testing models for you because whatever I pick will just seem like cherry picking to you, and it doesn't matter to me whether or not you agree on the usability of these models. They work for me, and that's all that matters to me. Try a a few completions instead of a question. Or don't
Honestly we can, I haven't prompted it enough what do you want to use the model for?
so the takeaway is basically, don't run a model if you don't know where it came from
What is going to happen? Please publish your models so we can run them ourselves.
Not sure it's possible. The more accurate the models get, the buggier they are.
what's the most effective model you've seen?
I'm not seeing the equivalence. Isn't the announcement here to let you run any model?