Skip to content

support shard embeding#70

Open
qingshui wants to merge 468 commits intohutuxian:paddleboxfrom
qingshui:paddlebox
Open

support shard embeding#70
qingshui wants to merge 468 commits intohutuxian:paddleboxfrom
qingshui:paddlebox

Conversation

@qingshui
Copy link
Copy Markdown

PR types

PR changes

Describe

qingshui and others added 30 commits October 22, 2021 15:52
fix ins bug, add mean logloss gpu op
add gpu sample memory pool
use diff thres during pull sparse
humingqing and others added 30 commits December 18, 2023 19:15
add fill zero in fused_seqpool_cvm
add fused seq tensor && support transpose batch fc weight
* fused_seqpool_cvm_with_conv support filter by threshold

* add fill zero in fused_seqpool_cvm

* add fused seq tensor && support transpose batch fc weight

---------

Co-authored-by: mojingcj <ChengJing_dhu@163.com>
Co-authored-by: jiaoxuewu <jiaoxuewu@163.com>
Co-authored-by: yuandong1998 <1377526365@qq.com>
Co-authored-by: shangzhongbin <shangzhongbin@baidu.com>
* fused_seqpool_cvm_with_conv support filter by threshold

* add fill zero in fused_seqpool_cvm

* add fused seq tensor && support transpose batch fc weight

---------

Co-authored-by: mojingcj <ChengJing_dhu@163.com>
Co-authored-by: jiaoxuewu <jiaoxuewu@163.com>
Co-authored-by: yuandong1998 <1377526365@qq.com>
Co-authored-by: shangzhongbin <shangzhongbin@baidu.com>
fix fused query seq tensor compare case
* fused_seqpool_cvm_with_conv support filter by threshold

* add fill zero in fused_seqpool_cvm

* add fused seq tensor && support transpose batch fc weight

* fix fused query seq tensor compare case

---------

Co-authored-by: mojingcj <ChengJing_dhu@163.com>
Co-authored-by: jiaoxuewu <jiaoxuewu@163.com>
Co-authored-by: yuandong1998 <1377526365@qq.com>
Co-authored-by: shangzhongbin <shangzhongbin@baidu.com>
…u thread num, fused_seqpool_cvm gpu memory alloc optimize
修复dump core问题,优化大数据写磁盘内存会超问题改成分段写入,优化fused_seqpool_cvm concat性能,优化fused_seqpool_cvm显存分配以及连续访问提升性能单op H800机型提升60倍整体提升25%
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.