Alibaba Group releases Qwen, 14B parameter LLM

post by nikola (nikolaisalreadytaken) · 2023-09-28T00:12:03.653Z · LW · GW · 1 comments

This is a link post for https://qianwen-res.oss-cn-beijing.aliyuncs.com/QWEN_TECHNICAL_REPORT.pdf

Contents

1 comment

Some highlights from the technical report (github repo here):

Qwen beats every other LLM of a similar size on a wide variety of benchmarks.
Qwen's overall benchmark performance is somewhere between Llama 2 and GPT 3.5

1 comments

Comments sorted by top scores.

comment by Davidmanheim · 2023-09-29T05:22:26.093Z · LW(p) · GW(p)

My comment from Twitter: "Alibaba's release of Qwen-14B without any ethical evaluation, reporting of training data sources, evaluation of misuse potential, red-teaming, or anything else resembling best practice for SOTA models - and the lack of discussion of this fact - is extremely disappointing."