AbstractDL | Telegram Webview: abstractDL/169 -

Telegram Group & Telegram Channel

Chain-of-Thought: дайте GPT поразмышлять перед ответом! (by Google)

Большинство промптов для zero-shot нацелены на немедленное получение ответа, но оказалось, если дать языковой модели «поразмышлять вслух» над задачей, то вероятность правильного решения значительно повышается.

Теперь это кажется чертовски логичным! Ведь требовать ответ сразу без возможности подумать это слишком жестоко даже для людей 😅

Добавление простого «Lets think step-by-step» промпта позволило языковой модели PaLM обойти человека на 10 из 23 задач Big-Bench! Думаю, что chain-of-thought подход теперь должен стать общепринятым.

А ещё Google выпустил мультиязычную модель Flan-T5-xxl, которая дополнительно затюнена под этот промпт + лосс из UL2.

P.S. На скриншоте результат генерации для GPT-j.

Статья

www.group-telegram.com/us/abstractDL.com/169

5.5K viewsedited Nov 1, 2022 at 19:08

group-telegram.com/abstractDL/169

Create: 2022-11-01
Last Update: 2025-02-11 14:12:14

Chain-of-Thought: дайте GPT поразмышлять перед ответом! (by Google)

Большинство промптов для zero-shot нацелены на немедленное получение ответа, но оказалось, если дать языковой модели «поразмышлять вслух» над задачей, то вероятность правильного решения значительно повышается.

Теперь это кажется чертовски логичным! Ведь требовать ответ сразу без возможности подумать это слишком жестоко даже для людей 😅

Добавление простого «Lets think step-by-step» промпта позволило языковой модели PaLM обойти человека на 10 из 23 задач Big-Bench! Думаю, что chain-of-thought подход теперь должен стать общепринятым.

А ещё Google выпустил мультиязычную модель Flan-T5-xxl, которая дополнительно затюнена под этот промпт + лосс из UL2.

P.S. На скриншоте результат генерации для GPT-j.

Статья

BY AbstractDL

Share with your friend now:
group-telegram.com/abstractDL/169

Open in Telegram

Telegram | DID YOU KNOW?

Date: 2025-02-11|

Given the pro-privacy stance of the platform, it’s taken as a given that it’ll be used for a number of reasons, not all of them good. And Telegram has been attached to a fair few scandals related to terrorism, sexual exploitation and crime. Back in 2015, Vox described Telegram as “ISIS’ app of choice,” saying that the platform’s real use is the ability to use channels to distribute material to large groups at once. Telegram has acted to remove public channels affiliated with terrorism, but Pavel Durov reiterated that he had no business snooping on private conversations. During the operations, Sebi officials seized various records and documents, including 34 mobile phones, six laptops, four desktops, four tablets, two hard drive disks and one pen drive from the custody of these persons. This ability to mix the public and the private, as well as the ability to use bots to engage with users has proved to be problematic. In early 2021, a database selling phone numbers pulled from Facebook was selling numbers for $20 per lookup. Similarly, security researchers found a network of deepfake bots on the platform that were generating images of people submitted by users to create non-consensual imagery, some of which involved children. At this point, however, Durov had already been working on Telegram with his brother, and further planned a mobile-first social network with an explicit focus on anti-censorship. Later in April, he told TechCrunch that he had left Russia and had “no plans to go back,” saying that the nation was currently “incompatible with internet business at the moment.” He added later that he was looking for a country that matched his libertarian ideals to base his next startup. On Telegram’s website, it says that Pavel Durov “supports Telegram financially and ideologically while Nikolai (Duvov)’s input is technological.” Currently, the Telegram team is based in Dubai, having moved around from Berlin, London and Singapore after departing Russia. Meanwhile, the company which owns Telegram is registered in the British Virgin Islands.
from us

Telegram AbstractDL
FROM American