Jump to content

Super-Tiny Language Models Controversy


Recommended Posts

Just posted a short LinkedIn article debunking some claims that "super-tiny" language models have "comparable" performance with respect to much larger models. Turns out if you actually look in the paper the performance of the super-tiny model is at or below random guess probability for several datasets.... 
First I thought this must be the secondary reporting, but now I am looking more at the original authors who published a pre-print with weak results and contradicting strong claims. What is the process of asking for a revised version, does anyone here know? Contact the authors or Arxiv? 
Here is the link to the post: Problems with Super-Tiny Language Models - LinkedIn Article

  • Like 1
Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...