Meta quietly releases Llama 2 Lengthy AI mannequin

September 29, 2023

77

[ad_1]

VentureBeat presents: AI Unleashed – An unique govt occasion for enterprise knowledge leaders. Community and be taught with business friends. Study Extra

Meta Platforms confirmed off a bevy of latest AI options for its consumer-facing companies Fb, Instagram and WhatsApp at its annual Meta Join convention in Menlo Park, California, this week.

However the largest information from Mark Zuckerberg’s firm might have really come within the type of a pc science paper printed with out fanfare by Meta researchers on the open entry and non-peer reviewed web site arXiv.org.

The paper introduces Llama 2 Lengthy, a brand new AI mannequin primarily based on Meta’s open supply Llama 2 launched in the summertime, however that has undergone “continuous pretraining from Llama 2 with longer coaching sequences and on a dataset the place lengthy texts are upsampled,” in keeping with the researcher-authors of the paper.

Because of this, Meta’s newly elongated AI mannequin outperforms a few of the main competitors in producing responses to lengthy (greater character rely) person prompts, together with OpenAI’s GPT-3.5 Turbo with 16,000-character context window, in addition to Claude 2 with its 100,000-character context window.