minus-squarebilouba@jlai.lutoPrivacy@lemmy.ml•I'm tired of LLM bullshitting. So I fixed it.linkfedilinkarrow-up0·3 months agoVery impressive! Do you have benchmark to test the reliability? A paper would be awesome to contribute to the science. linkfedilink
Very impressive! Do you have benchmark to test the reliability? A paper would be awesome to contribute to the science.