The DouBao big model team proposed the sparse model architecture UltraMem

Feb 13

On February 12, according to the news of the Doubao Large Model team, the ByteDance Doubao Large Model Foundation team recently proposed UltraMem, a sparse model architecture that also decoupled the calculation and parameters, and solved the problem of inference access under the premise of ensuring the model effect. According to reports, the architecture effectively solves the high memory access problem of MoE reasoning, the reasoning speed is increased by 2-6 times compared with the MoE architecture, and the reasoning cost can be reduced by up to 83%

.

Comments

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts

数据猿

The DouBao big model team proposed the sparse model architecture UltraMem