Alibaba Group releases Qwen, 14B parameter LLM

nikola-jurkovic

Alibaba Group releases Qwen, 14B parameter LLM

post by Nikola Jurkovic (nikolaisalreadytaken) · 2023-09-28T00:12:03.653Z · LW · GW · 1 comments

This is a link post for https://qianwen-res.oss-cn-beijing.aliyuncs.com/QWEN_TECHNICAL_REPORT.pdf

1 comment

Some highlights from the technical report (github repo here):

Qwen beats every other LLM of a similar size on a wide variety of benchmarks.

Qwen's overall benchmark performance is somewhere between Llama 2 and GPT 3.5

1 comments

Comments sorted by top scores.

comment by Davidmanheim · 2023-09-29T05:22:26.093Z · LW(p) · GW(p)

My comment from Twitter: "Alibaba's release of Qwen-14B without any ethical evaluation, reporting of training data sources, evaluation of misuse potential, red-teaming, or anything else resembling best practice for SOTA models - and the lack of discussion of this fact - is extremely disappointing."

Alibaba Group releases Qwen, 14B parameter LLM

Contents

1 comments