Showing 120 of 120on this page. Filters & sort apply to loaded results; URL updates for sharing.120 of 120 on this page
GitHub - NormXU/Consistent-DynamicNTKRoPE: An Experiment on Dynamic NTK ...
Inconsistent Rotation Base for Dynamic NTK Scaling RoPE · Issue #25104 ...
Trying to apply Dynamic NTK RoPE scaling into exllama. · Issue #126 ...
Experiments on Consistent Rotation Base for Dynamic NTK RoPE : r/LocalLLaMA
vLLM Qwen-style dynamic NTK ROPE kernel · Issue #693 · QwenLM/Qwen · GitHub
支持多长输入 TurboMind supports Qwen-7B, dynamic NTK-RoPE scaling and dynamic ...
Kinematic and Dynamic Modelling for a Class of Hybrid Robots Composed ...
initial (left) and final (right) layerwise NTK for layer 2, the second ...
DYNAMIC DUO-NTK QUINTEZ and Lil D - YouTube
大模型长度扩展:直接外推, PI, NTK-aware, NTK-by-parts, Dynamic NTK, ALiBi, YaRN, S2 ...
Track Your NTK Academic Group Order Status - AfterShip
Dynamic modeling of LTR arm system. | Download Scientific Diagram
Dynamic vs. Static - What's the Difference? | This vs. That
A Dynamic Model for Continuous Lowering Analysis of Deep-Sea Equipment ...
dynamic - Codesandbox
Figure 1 from Modeling of flexible non-linear dynamic links in Nano ...
Dynamic Modeling of Planar Multi-Link Flexible Manipulators
NTK
Dynamic Reflection | GANREF
A Multi-Layer, Multi-Robot Control Architecture for Long-Range, Dynamic ...
[논문 리뷰] DLO: Dynamic Layer Operation for Efficient Vertical Scaling of LLMs
[Question] NTK rope in decoding phase ? · Issue #571 · InternLM ...
Dr.LLM: Dynamic Layer Routing in LLMs Neat technique to reduce ...
Opus Evo – Tactical Optimization of Dynamic Scenarios | Systecon Group ...
Dynamic DataFrame filtering in Streamlit | by Oleksandr Arsentiev ...
Frame structure diagram for dynamic link adaptation operation ...
NTK - Honing, grinding, hard turning, cnc-turning and more
RoPE外推优化——支持192K上下文长度
RoPE外推优化——支持192K上下文长度 - 知乎
探秘Transformer系列之(23)--- 长度外推 - 知乎
如何修改大模型的位置编码 --以LLama为例_llama3 修改位置-CSDN博客
GitHub - Bowen-n/vllm-dynamic-ntk: This version implements Qwen's ...
[AI算法] 什么事RoPE scaling
智源FlagAttention:面向多种训练芯片的大模型高性能Triton算子集 - 知乎
从Rope到ALiBi、PI、NTK-Aware、Dynamic Scaling、NTK-by-parts、Yarn长度外推方案详解 - 知乎
【手撕 YaRN】LLM 训短推长,一直外推一直爽!OpenAI / DeepSeek / Qwen 严选位置编码 - 知乎
Inductive Positions in Transformers | The Gradient
DynamicNTKRoPE算法 - 知乎
大模型外推 | 外推方法 - 知乎
位置编码之路:SIN->ALiBi->RoPE ->PI->NTK->YARN - 知乎
深度学习基础:Neural Tangent Kernel - 知乎
scaling-rope/README.md at main · OpenLMLab/scaling-rope · GitHub
Neural Tangent Kernel (NTK)基础推导 - Gearlesskai - 博客园
理解Neural Tangent Kernel(NTK) - 知乎
Quickstart Guide — InternEvo 0.3.0 documentation
Comparison of NLP positional encoding schemes : r/MLQuestions
Summary post for higher context sizes for this week. For context up to ...
LLM学习笔记-长度外推技术 - 老张哈哈哈 - 博客园
大規模言語モデル(LLM)の進化と主要技術
Extending the RoPE | EleutherAI Blog
#llmam | Tao Jin
Qwen-7B推理过程详解 - 知乎
Long-Context下LLM模型架构全面介绍_long context-CSDN博客
大模型结构基础(二):Positional Encodings 的升级 - 知乎
万字长文梳理 LLM 中的长文本问题-CSDN博客
Building an LLM Stack, Part 1: Implementing Encoders and Decoders ...
제미나이(Gemini) 3.0 동적뷰(Dynamic View) 기능 사용 방법 : 네이버 블로그
【LLM】相关技术总结 | Xinyao
LLM上下文窗口突破200万!无需架构变化+复杂微调,轻松扩展8倍 | 人人都是产品经理
【手撕LLM-NTK RoPE】长文本“高频外推、低频内插“从衰减性视角理解 - 知乎
从0开始实现LLM:5、长上下文优化(代码篇)YaRN/CLEX/LongLoRA/LM-Infinite/StreamingLLM - 知乎
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens|AI Paper ...
FY26 CIP p.2 - 2/1/25 (part 2 of 2)
大模型处理长上下文方法一览,零基础入门到精通,收藏这篇就够了~_大模型 长上下文-CSDN博客
RoPE外推的缩放法则 —— 尝试外推RoPE至1M上下文 - 知乎
LLM系列 | 26:阿里千问Qwen模型解读、本地部署 - 知乎
从0开始实现LLM:4、长上下文优化(理论篇) - 知乎
Papers Explained 363: UltraLong. This work introduces an efficient ...
LLM之位置编码算法总结 - 知乎
A+B Industrial Tools Company - DE_Tungaloy-NTK_Promotion - Pagina 1
LLM技术:ICL Principle(持续更新) - 知乎
ROPE及各种变体-代码解读_rope代码-CSDN博客
LLM Inference Acceleration: GPU Optimization for Attention in the ...
Understanding Transformers & the Architecture of LLMs
Research Projects - Vinoth
Modular LLM Architectures | AI Tutorial | Next Electronics
01解读技术报告初识书生.浦语2——internlm2系列 | 柠檬CC
Webb Cracks Case of Inflated Exoplanet - NASA Science
Transformer为什么需要“位置编码”?_transformer位置编码-CSDN博客
LongRoPE2: Near-Lossless LLM Context Window Scaling · HF Daily Paper ...
20B的体量,70B的性能,书生·浦语InternLM-20B带领开源大模型进入新时代 - 上海人工智能实验室
【手撕LLM-NTK RoPE】长文本“高频外推、低频内插“从衰减性视角理解_ntk外推-CSDN博客
NTKプライベートツアー (2026) - All You SHOULD Know Before Going (with Reviews)
Norquinal/Qwen-7B-reupload · Hugging Face