Category: LM News

  • Lion EvoLved Sign Momentum: The New Optimizer Discovered by Google Brain

    โ€”

    by

    in

    ๐Ÿ“Œ According to the authors of the paper, a suitable learning rate for Lion is typically 3-10 times lower than that used with Adam(w). Since the effective weight decay is lr * ฮป, the value of the decoupled weight decay ฮป used for Lion is 3-10 times larger than that used with Adam(w) to maintain similar strength. ๐Ÿ“Œ The…

  • The Future of Smartphones: Betting Big on sLM

    โ€”

    by

    in

    Mobile device manufacturers are optimistic about the prospects for the use of artificial intelligence (AI) in smartphones. Companies like Qualcomm and MediaTek have launched smartphone chipsets that have enough muscle for processing AI applications. Previously, many AI applications on devices were partially processed in the cloud and then offloaded to the phone. However, cloud-based models…

  • Apple Open Sources Large Models for Mobile Devices!

    โ€”

    by

    in

    Apple has released an artificial intelligence (AI) model called OpenELM ( Open Efficient Language Model ), along with its code, weights, data sets, and training processes. Like Google, Samsung and Microsoft, which are focusing on developing generative AI models on both desktop and mobile devices, Apple has also joined this trend. This marks the birth of a new family…

  • OpenBioLLM-70B and 8B: Outperforms GPT-4, Gemini, Meditron-70B, Med-PaLM-1 and Med-PaLM-2 in the medical domain

    โ€”

    by

    in

    The developers of this model created a custom and diverse dataset, collaborating with medical experts to ensure the highest quality. The dataset covers over 3,000 healthcare topics and 10+ medical subjects. The outstanding performance of OpenBioLLM-70B is evident on 9 diverse biomedical datasets, achieving an impressive average of 86.06% despite having fewer parameters than GPT-4…

Translate ยป