issue/1117: metax support flash-attn by Ceng23333 · Pull Request #1119 · InfiniTensor/InfiniCore

Ceng23333 · 2026-04-03T04:34:42Z

qinyiqun · 2026-04-15T08:10:41Z


 void run(void *planned_meta) {
 #ifdef ENABLE_FLASH_ATTN
+#ifdef ENABLE_NVIDIA_API


这块的两个if的内容是一样的吧

好像确实一样

刚才轶群说，QY也需要这个，所以是不是合并一下把判断去掉就行了？还是说把三个都加上才能不影响其他平台？

qinyiqun · 2026-04-15T08:11:26Z

 #include <stdexcept>

+#ifdef ENABLE_FLASH_ATTN
+#if defined(ENABLE_NVIDIA_API) || defined(ENABLE_METAX_API)


也需要加qy？或者不需要加这个if

看原来的实现，QY不需要这个

没加的话，就是默认需要，因为qy默认走跟nv一样的路径

wooway777 · 2026-04-17T01:18:28Z


 void run(void *planned_meta) {
 #ifdef ENABLE_FLASH_ATTN
+#ifdef ENABLE_NVIDIA_API


好像确实一样

wooway777 · 2026-04-21T06:42:58Z


 void run(void *planned_meta) {
 #ifdef ENABLE_FLASH_ATTN
+#ifdef ENABLE_NVIDIA_API


刚才轶群说，QY也需要这个，所以是不是合并一下把判断去掉就行了？还是说把三个都加上才能不影响其他平台？

wooway777 · 2026-04-21T06:53:54Z

-    auto out_tensor = infinicore::adaptor::to_aten_tensor(p->out);
+    // Paged KV caches must be contiguous for flash-attn; avoid extra copies for q/metadata when already dense.
+    auto out_at = infinicore::adaptor::to_aten_tensor(p->out);
+    const bool out_need_copy_back = !out_at.is_contiguous();


这两个contiguous nvidia本身是不是不需要做？
另外应该改成先用我们的contiguous再转成aten tesnor

wooway777 · 2026-04-21T06:54:21Z

    auto v_cache = infinicore::adaptor::to_aten_tensor(p->v_cache);
-#elif defined(ENABLE_QY_API)
+#elif defined(ENABLE_QY_API) || defined(ENABLE_METAX_API)
    auto k_cache = infinicore::adaptor::to_aten_tensor(p->k_cache).contiguous();


先用我们的contiguous再转成aten tesnor

wooway777 · 2026-04-21T06:54:38Z

+    VarlenFlashPrepared t;
+    // Varlen flash-attn: keep k/v contiguous for dense/paged layout; avoid extra copies for q/metadata when already dense.
+    t.q = infinicore::adaptor::to_aten_tensor(p->q);
+    t.k = infinicore::adaptor::to_aten_tensor(p->k).contiguous();


先用我们的contiguous再转成aten tesnor

Signed-off-by: Ceng23333 <441651826@qq.com>

Ceng23333 requested review from a team, Ziminli, kilinchange, voltjia and wooway777 April 3, 2026 04:34

qinyiqun reviewed Apr 14, 2026

View reviewed changes

Comment thread xmake.lua Outdated

qinyiqun reviewed Apr 14, 2026

View reviewed changes

Comment thread include/infinicore/adaptor/aten_adaptor.hpp Outdated

qinyiqun reviewed Apr 15, 2026

View reviewed changes

wooway777 requested a review from qinyiqun April 16, 2026 01:25

wooway777 reviewed Apr 16, 2026

View reviewed changes

Comment thread src/infinicore/ops/multi_head_attention_varlen/mha_varlen_flashattn.cc Outdated

wooway777 requested changes Apr 21, 2026

View reviewed changes

Ceng23333 requested a review from wooway777 April 22, 2026 02:35

qinyiqun approved these changes Apr 22, 2026

View reviewed changes

metax fla-attn

12244c2

Signed-off-by: Ceng23333 <441651826@qq.com>

Ceng23333 force-pushed the metax_fla branch from 21bf5e1 to 12244c2 Compare April 24, 2026 01:10

Ceng23333 added 2 commits April 24, 2026 01:31

fix condition

1ea59e9

Signed-off-by: Ceng23333 <441651826@qq.com>

fix hpcc include

f855d39

Signed-off-by: Ceng23333 <441651826@qq.com>

wooway777 approved these changes Apr 27, 2026

View reviewed changes

reduce contiguous

e74f074

Signed-off-by: Ceng23333 <441651826@qq.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

issue/1117: metax support flash-attn#1119

issue/1117: metax support flash-attn#1119
Ceng23333 wants to merge 4 commits intomainfrom
metax_fla

Ceng23333 commented Apr 3, 2026

Uh oh!

Uh oh!

Uh oh!

qinyiqun Apr 15, 2026

Uh oh!

wooway777 Apr 17, 2026

Uh oh!

wooway777 Apr 21, 2026

Uh oh!

qinyiqun Apr 15, 2026

Uh oh!

Ceng23333 Apr 17, 2026

Uh oh!

qinyiqun Apr 20, 2026

Uh oh!

Uh oh!

wooway777 Apr 17, 2026

Uh oh!

wooway777 Apr 21, 2026

Uh oh!

wooway777 Apr 21, 2026

Uh oh!

wooway777 Apr 21, 2026

Uh oh!

wooway777 Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Ceng23333 commented Apr 3, 2026

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants