ROM4AI

2026-05-25 #research #ai

AI Hardware Weekly Digest: WorldKV for Video World Models, Gated DeltaNet-2 Linear Attention, and HRM-Text Efficient Pretraining

kv-cache transformer ai-accelerator

0.0 (0 votes)

2026-05-24 #research #ai

AI Hardware Weekly Digest: Anthropic-Microsoft Maia 200 Deal Talks, LlamaWeb Browser Inference, and InnerQ Hardware-Aware Quantization

llm-inference ai-accelerator edge-ai

0.0 (0 votes)

2026-05-23 #research #ai

AI Hardware Weekly Digest: Runtime-Certified Quantized Attention, BrainChip AKD1500 Mass Production, and Qwen3.7-Max 1M Context

kv-cache llm-inference neuromorphic

0.0 (0 votes)

2026-05-22 #research #ai

AI Hardware Weekly Digest: Exciton-Polariton All-Optical Computing, Samsung Strike Deal, and Custom AI ASIC Landscape

photonic ai-accelerator memory-system

0.0 (0 votes)

2026-05-21 #research #ai

AI Hardware Weekly Digest: TurboQuant 4.6× KV Cache Compression, RePlaid Continuous Diffusion Scaling, and CSIRO Vetra Edge AI

kv-cache llm-inference diffusion-model

0.0 (0 votes)

2026-05-20 #research #ai

AI Hardware Weekly Digest: TriAxialKV Mixed-Precision Quantization, KVDrive Multi-Tier Cache, and NVIDIA $106B Earnings

kv-cache llm-inference ai-accelerator

0.0 (0 votes)

2026-05-19 #research #ai

AI Hardware Weekly Digest: SANA-WM Efficient World Model, Samsung HBM for Mobile, and Google I/O 2026

world-model ai-accelerator memory-system

0.0 (0 votes)

2026-05-19 #research #ai

AI Hardware Weekly Digest: CXMT 719% Revenue Surge, Memristor CIM Breakthrough, and Google I/O 2026

memory-system ai-accelerator neuromorphic

0.0 (0 votes)

2026-05-16 #research #ai

AI Hardware Weekly Digest: KV-RM Static-Graph Serving, Quantization Security Risks, and Samsung Strike Threat

kv-cache quantization llm-inference

0.0 (0 votes)

2026-05-15 #research #ai

AI Hardware Weekly Digest: World Action Models, Multi-Agent Coordination, and Cerebras IPO Pricing

world-model robotics neuromorphic

0.0 (0 votes)

2026-05-14 #research #ai

AI Hardware Weekly Digest: FibQuant KV Cache, KV-Fold Recurrence, and Jensen Huang's China Trip

AI Hardware Weekly Digest: FibQuant KV Cache, KV-Fold Recurrence, and Jensen Huang’s China Trip

kv-cache quantization llm-inference

0.0 (0 votes)

2026-05-13 #research #ai

AI Hardware Weekly Digest: int4 KV Cache Beats fp16 on Apple Silicon, Federation of Experts, and Cerebras IPO

kv-cache quantization llm-inference

0.0 (0 votes)

2026-05-12 #research #ai

AI Hardware Weekly Digest: LaProx KV Cache, SpikingBrain, Cola DLM, and Intel's Neuromorphic Bet

AI Hardware Weekly Digest: LaProx KV Cache, SpikingBrain, Cola DLM, and Intel’s Neuromorphic Bet

kv-cache neuromorphic diffusion-model

0.0 (0 votes)

2026-05-11 #research #ai

AI 硬件研究周报（2026.05.11）：EA-WM 事件感知生成世界模型、RecursiveMAS 递归多智能体系统、机器人世界模型综述

world-model robotics ai-accelerator

0.0 (0 votes)

2026-05-10 #research #ai

AI 硬件研究周报（2026.05.10）：GYAN 神经符号语言模型、Embody4D 4D 世界模型、PV-VAE 预测性视频生成、ParoQuant 旋转量化

neural-symbolic world-model robotics

0.0 (0 votes)

2026-05-07 #research #ai

AI 硬件研究周报（2026.05.07）：OpenAI 机器人硬件分拆上市、Broadcom 10GW 定制加速器、Flow Matching ODE 求解器硬件优化

ai-accelerator robotics training

0.0 (0 votes)

2026-05-06 #research #ai

AI 硬件研究周报（2026.05.06）：NEURON 神经符号临床系统、无 DRAM AI 推理芯片（Fractile）、WindowQuant VLM KV Cache 量化、SNN 无反向传播学习

neural-symbolic ai-accelerator kv-cache

0.0 (0 votes)

2026-05-05 #research #ai

AI 硬件研究周报（2026.05.05）：视觉生成五层范式演进、Dual-Blade 边缘 KV Cache 卸载、RISC-V 成为 AI 硬件开放基础

world-model edge-ai kv-cache

0.0 (0 votes)

2026-05-04 #research #ai

AI 硬件研究周报（2026.05.04）：KV Cache 三维优化（DepthKV/PolyKV/CacheFlow）、HBM-PIM 张量加速、World-R1 几何一致性世界模型

kv-cache memory-system ai-accelerator

0.0 (0 votes)

2026-05-03 #research #ai

AI 硬件研究周报（2026.05.03）：图世界模型统一范式、EdgeSpike 超低功耗 SNN 框架、HfO₂ 忆阻突触降低 70% 能耗

world-model neuromorphic edge-ai

0.0 (0 votes)

2026-05-02 #research #ai

AI 硬件研究周报（2026.05.02）：普林斯顿3D生物混合神经芯片、腾讯HY-Embodied具身模型、LLM递归自我改进的数学不可能性证明

neuromorphic robotics world-model

0.0 (0 votes)

2026-05-01 #research #ai

AI 硬件研究周报（2026.05.01）：具身 AI 的 3D 生成综述、脉冲神经元逻辑电路、Motubrain 世界动作模型

robotics world-model neuromorphic

0.0 (0 votes)

2026-05-01 #research #ai

Agentic Harness Engineering: 可观测性驱动的编码智能体 Harness 自动进化

llm-inference agent-systems world-model

0.0 (0 votes)

2026-04-30 #research #ai

AI 硬件研究周报（2026.04.30）：代理世界模型分类学、行为克隆缩放定律、信号折叠神经形态硬件

world-model robotics neuromorphic

0.0 (0 votes)

2026-04-29 #research #ai

AI 硬件研究周报（2026.04.29）：LingBot-Map 流式 3D 重建、DeepSeek V4 混合注意力架构、MOMO 机器人技能学习

world-model robotics llm-inference

0.0 (0 votes)

2026-04-28 #research #ai

AI 硬件研究周报（2026.04.28）：概率计算处理器、信号折叠神经形态硬件、低功耗计算机视觉挑战

neuromorphic low-power edge-ai

0.0 (0 votes)

2026-04-26 #research #ai

AI 硬件研究周报（2026.04.26）：边缘 LLM 推理的 KV Cache 优化、CPU-GPU 混合注意力、跨数据中心 Prefill 服务

llm-inference kv-cache edge-ai

0.0 (0 votes)

2026-04-25 #ai-accelerator #chiplet

Design Conductor: AI Agent 12小时自主设计1.5GHz RISC-V CPU

ai-accelerator chiplet llm-inference

0.0 (0 votes)

2026-04-25 #research #ai

AI 硬件研究周报（2026.04.18-04.25）：世界模型用于机器人训练、分子忆阻器神经形态硬件、NSF NeuroAI 路线图

world-model robotics neuromorphic

0.0 (0 votes)

2026-04-24 #research #ai

AI 硬件研究周报（2026.04.18-04.24）：LLM 生成硬件的表示瓶颈、概率 Ising 机并行加速、KV Cache 神经垃圾回收

ai-accelerator llm-inference kv-cache

0.0 (0 votes)

2026-04-23 #paper #neuro-symbolic-ai

Hardware-Efficient Neuro-Symbolic Networks with Exp-Minus-Log Operator

原文: arXiv:2604.13871 核心贡献: 提出 DNN-EML 架构，使用单一硬件可实现的 Sheffer 算子实现神经符号网络

neuro-symbolic hardware fpga

0.0 (0 votes)

2026-04-23 #paper #neuro-symbolic-ai

The Price Is Not Right: Neuro-Symbolic AI Outperforms VLAs with 100x Lower Energy

原文: arXiv:2602.19260 作者: Timothy Duggan, Pierrick Lorang, Hong Lu, Matthias Scheutz 机构: Tufts University 核心贡献: 神经符号方法在结构化长视野操作任务上超越 VLA，能耗降低 100 倍

neuro-symbolic embodied-ai vla

0.0 (0 votes)

2026-04-22 #ai-hardware #llm-inference

Switch-Centric In-Network Architecture for Accelerating LLM Inference

tensor-parallelism in-network-computing all-reduce

0.0 (0 votes)

2026-04-22 #neuromorphic-computing #ai-hardware

Neuromorphic Computing for Low-Power Artificial Intelligence

spiking-neural-networks edge-computing energy-efficiency

0.0 (0 votes)

2026-04-22 #embodied-ai #vision-language-models

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

embodied-ai vla robot-manipulation

0.0 (0 votes)

2026-04-22 #ai-accelerator #chiplet

EPAC: The Last Dance - 欧洲RISC-V HPC加速器芯片的全栈实践

ai-accelerator chiplet risc-v

0.0 (0 votes)

2026-04-22 #ai-accelerator #llm-inference

CUTEv2: 面向多样化CPU架构的统一可配置矩阵扩展

ai-accelerator llm-inference quantization

0.0 (0 votes)

2026-04-14 #weekly-digest #ai-hardware

AI Hardware Weekly Digest - April 14, 2026

ai-chip embodied-ai world-model

0.0 (0 votes)

2026-04-13 #neural-symbolic #robotics

Build on Priors: 视觉-语言引导的神经符号模仿学习实现数据高效的机器人操作

neural-symbolic robotics world-model

0.0 (0 votes)

2026-04-08 #agent #llm

EvoSkills: 通过协同进化验证实现智能体技能的自我进化

agent-skills co-evolution surrogate-verification

0.0 (0 votes)

2026-04-08 #ai-accelerator #memory-system

AI 硬件加速前沿：从 3D 堆叠内存到 LLM 解码优化

3d-memory processing-in-memory llm-decoding

0.0 (0 votes)

2026-04-07 #paper #quantization

MicroScopiQ: 通过异常值感知微缩放量化加速基础模型

原文: arXiv:2411.05282 | PDF 会议: ISCA 2025, Tokyo, Japan 作者: Akshat Ramachandran, Souvik Kundu, Tushar Krishna 机构: Georgia Institute of Technology, Intel La...

ai-accelerator quantization pruning

0.0 (0 votes)

2026-04-07 #llm-inference #ai-accelerator

TIE Scheduler: Uncertainty-Aware Output Length Prediction for Efficient LLM Inference Scheduling

llm-inference ai-accelerator memory-system

0.0 (0 votes)

2026-04-07 #embodied-ai #robotics

SOLE-R1: Video-Language Reasoning as the Sole Reward for On-Robot Reinforcement Learning

embodied-ai robotics world-model

0.0 (0 votes)

2026-04-07 #neuromorphic #ai-accelerator

Integer-State Dynamics of Quantized Spiking Neural Networks for Efficient Hardware Acceleration

neuromorphic ai-accelerator low-power

0.0 (0 votes)

2026-04-05 #paper #llm-inference

SALS: 潜在空间稀疏注意力实现 KV Cache 压缩

原文链接: arXiv:2510.24273 PDF

llm-inference kv-cache sparsity

0.0 (0 votes)

2026-04-04 #world-model #diffusion-model

Video Generation Models as World Models: Efficient Paradigms, Architectures and Algorithms Survey

world-model diffusion-model edge-ai

0.0 (0 votes)

2026-04-04 #ai-accelerator #llm-inference

GPU-FPGA Heterogeneous Systems for Disaggregated LLM Inference: Memory Processing Pipeline Acceleration

ai-accelerator llm-inference memory-system

0.0 (0 votes)

2026-04-04 #world-model #robotics

EVA: Aligning Video World Models with Executable Robot Actions via Inverse Dynamics Rewards

world-model robotics embodied-ai

0.0 (0 votes)

2026-04-03 #neuromorphic #ai-accelerator

SPINIC: Programmable Superconducting Neuron with In-Memory Computation for Ultra-Efficient Neuromorphic Computing

neuromorphic ai-accelerator low-power

0.0 (0 votes)

2026-04-03 #world-model #robotics

RoboStereo: Dual-Tower 4D Embodied World Models for Unified Policy Optimization

world-model robotics embodied-ai

0.0 (0 votes)

2026-04-03 #research #ai

GPU-FPGA 异构系统加速 LLM 推理中的内存处理

ai-accelerator llm-inference edge-ai

0.0 (0 votes)

2026-04-03 #neural-symbolic #ai-accelerator

AS2: Attention-Based Soft Answer Sets for End-to-End Differentiable Neuro-Soft-Symbolic Reasoning

neural-symbolic llm-inference reasoning

0.0 (0 votes)

2026-04-02 #paper #diffusion-llm

Beyond GEMM-Centric NPUs: 高效扩散 LLM 采样架构

原文: arXiv:2601.20706 | PDF 作者: Binglei Lou, Haoran Wu, Yao Lai, et al. (Imperial College London, University of Cambridge) 核心贡献: 针对扩散 LLM 采样优化 NPU 架构，提出 d-...

ai-accelerator diffusion-model llm-inference

0.0 (0 votes)

2026-04-02 #paper #heterogeneous-computing

异构计算：AI Agent 推理的未来关键

原文: arXiv:2601.22001 | PDF 作者: Aaron Zhao (Imperial College London), Junyi Liu (Microsoft Research) 核心贡献: 提出系统级异构计算是 AI Agent 推理的关键，识别”内存容量墙”问题

ai-accelerator heterogeneous-computing ai-agent

0.0 (0 votes)

2026-04-02 #paper #kernel-generation

KernelCraft: 面向新兴硬件的Agentic底层内核生成基准测试

原文: arXiv:2603.08721 | PDF 作者: Jiayi Nie, Haoran Wu, et al. (University of Cambridge, Imperial College London, AMD, University of Edinburgh) 核心贡献: 首个评估 LL...

ai-accelerator llm-inference kernel-generation

0.0 (0 votes)

2026-04-02 #paper #vision-transformer

EQ-ViT: Algorithm-Hardware Co-Design for Real-Time Vision Transformer Acceleration on Versal ACAP

原文: EQ-ViT: Algorithm-Hardware Co-Design for End-to-End Acceleration of Real-Time Vision Transformer Inference on Versal ACAP Architecture 会议: ESWEEK 2024...

ai-accelerator algorithm-hardware-co-design transformer

0.0 (0 votes)

2026-04-02 #research #neuromorphic-computing

Neuromorphic Computing Roadmap: Scaling Brain-Inspired AI to Production

neuromorphic scaling roadmap

0.0 (0 votes)

2026-04-02 #research #edge-ai

MediaTek Genio Pro: 50+ TOPS Edge AI Chip for Robotics and Embodied Intelligence

edge-ai robotics npu

0.0 (0 votes)

2026-04-02 #research #neuromorphic-computing

Innatera Synfire: Unifying the Neuromorphic Ecosystem for Edge AI

neuromorphic spiking-neural-networks edge-ai

0.0 (0 votes)

2026-04-02 #research #ai-accelerator

Google TurboQuant: 6x KV Cache Compression with Near-Optimal Distortion Rate

TurboQuant: Online Vector Quantization with Near-Optimal Distortion Rate

kv-cache quantization memory-efficiency

0.0 (0 votes)

2026-04-01 #paper #hardware-verification

UCV: 通过软件原生优化普及和加速硬件验证

论文: Democratizing and Accelerating Hardware Verification with Software-Native Optimization 会议: ISCA 2026 核心贡献: UnityChip Verification (UCV) - 软件原生硬件验证平台

ai-accelerator chiplet edge-ai

0.0 (0 votes)

2026-04-01 #research #ai-accelerator

VMXDOTP: RISC-V Vector ISA Extension for Efficient Microscaling (MX) Format Acceleration

risc-v quantization ai-accelerator

0.0 (0 votes)

2026-04-01 #research #ai-accelerator

UCV: 软件原生硬件验证平台 - ISCA 2026

UCV: 通过软件原生优化实现硬件验证的民主化与加速

ai-accelerator chiplet edge-ai

0.0 (0 votes)

2026-04-01 #research #neuromorphic

Programmable Superconducting Neuron for Ultra-Efficient Neuromorphic Computing

neuromorphic superconducting ai-accelerator

0.0 (0 votes)

2026-04-01 #research #ai-accelerator

ReNN-RV: Run-time PE Reconfiguration for DNN Inference Acceleration with Custom RISC-V ISA

risc-v ai-accelerator edge-ai

0.0 (0 votes)

2026-04-01 #research #world-model

LeWorldModel: Stable End-to-End JEPA World Models from Pixels

world-model jepa embodied-ai

0.0 (0 votes)

2026-04-01 #research #ai-accelerator

Helios: Hardware-Software Co-design for 3D-DRAM-based LLM Serving Accelerator

memory-system llm-inference ai-accelerator

0.0 (0 votes)

2026-04-01 #research #ai-accelerator

ChatNeuroSim: LLM Agent Framework for Automated CIM Accelerator Deployment

ChatNeuroSim: LLM Agent Framework for Automated CIM Accelerator Deployment and Optimization

memory-system ai-accelerator llm-inference

0.0 (0 votes)

2026-04-01 #research #ai-accelerator

AA-DiT: Algorithm-Architecture Co-Design for Diffusion Transformer Acceleration

diffusion-model ai-accelerator transformer

0.0 (0 votes)

2026-03-31 #research #photonic-computing

PRISM: Photonic Similarity Engine for KV Cache Block Selection in Long-Context LLM Inference

transformer llm-inference ai-accelerator

0.0 (0 votes)

2026-03-31 #research #neuromorphic-computing

PdNeuRAM: Forming-Free Multi-Bit ReRAM for Energy-Efficient Neuromorphic Computing

PdNeuRAM: Forming-Free, Multi-bit Pd/HfO₂ ReRAM for Energy-Efficient Neuromorphic Computing

edge-ai neuromorphic low-power

0.0 (0 votes)

2026-03-31 #research #neuro-symbolic-ai

Neuro-Symbolic AI Survey: Task-Directed Advances in the Black-Box Era

Neuro-Symbolic Artificial Intelligence: A Task-Directed Survey in the Black-Box Models Era

edge-ai transformer low-power

0.0 (0 votes)

2026-03-31 #research #embodied-ai

LeWorldModel: Stable End-to-End JEPA from Pixels for Embodied AI

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

ai-accelerator training robotics

0.0 (0 votes)

2026-03-31 #research #ai

Hummingbird+: Advancing FPGA-based LLM Deployment from Research Prototype to Edge Product

ai-accelerator llm-inference edge-ai

0.0 (0 votes)

2026-03-31 #research #ai

Hummingbird: A Smaller and Faster Large Language Model Accelerator on Embedded FPGA

ai-accelerator llm-inference edge-ai

0.0 (0 votes)

2026-03-30 #research #vision-transformer

ME-ViT: Memory-Efficient FPGA Accelerator for Vision Transformers

ME-ViT: A Single-Load Memory-Efficient FPGA Accelerator for Vision Transformers

transformer multi-modal llm-inference

0.0 (0 votes)

2026-03-30 #daily-roundup #neuromorphic

Daily Research: Neuromorphic Computing & Spiking Neural Networks

🔍 Today’s Research Focus

sparsity edge-ai neuromorphic

0.0 (0 votes)

2026-03-30 #daily-roundup #llm

Daily Research Roundup: LLM Hardware Acceleration & World Models

🔍 Today’s Research Focus

edge-ai transformer low-power

0.0 (0 votes)

2026-03-26 #research #ai

Understanding Bottlenecks for Efficiently Serving LLM Inference With KV Offloading

transformer llm-inference ai-accelerator

0.0 (0 votes)

2026-03-26 #research #ai

Speculating Experts Accelerates Inference for Mixture-of-Experts: 通过专家预取加速 MoE 推理

transformer llm-inference ai-accelerator

0.0 (0 votes)

2026-03-26 #robotics #neuro-symbolic

The Price Is Not Right: Neuro-Symbolic Methods Outperform VLAs on Structured Long-Horizon Manipulation Tasks

multi-modal ai-accelerator robotics

0.0 (0 votes)

2026-03-25 #research #ai

VLA-Perf: VLA 推理性能全景分析——NVIDIA 首个系统性研究

robotics multi-modal world-model

0.0 (0 votes)

2026-03-25 #research #ai

DS2SC-Agent: 从数据手册到 SystemC 模型的多智能体自动化生成流水线

chiplet llm-inference transformer

0.0 (0 votes)

2026-03-24 #llm #compression

ZipServ: Fast and Memory-Efficient LLM Inference with Hardware-Aware Lossless Compression

原文链接: arXiv PDF

memory-system llm-inference transformer

0.0 (0 votes)

2026-03-24 #accelerator #isa

MINISA: Minimal Instruction Set Architecture for Next-gen Reconfigurable Inference Accelerator

原文链接: arXiv PDF

llm-inference ai-accelerator edge-ai

0.0 (0 votes)

2026-03-23 #research #ai

Large Video Planner: 基于视频生成的通用机器人控制新范式

transformer multi-modal llm-inference

0.0 (0 votes)

2026-03-23 #research #ai

Large Video Planner: 用视频生成实现通用机器人控制

transformer multi-modal llm-inference

0.0 (0 votes)

2026-03-23 #research #ai

History-Guided Video Diffusion: 用历史引导实现超长视频生成

multi-modal diffusion-model llm-inference

0.0 (0 votes)

2026-03-23 #engineering #ai

模型够聪明之后，工程师该做什么：Harness Engineering 实战指南

memory-system ai-accelerator llm-inference

0.0 (0 votes)

2026-03-23 #research #ai

Design Conductor: AI 自主构建 1.5GHz RISC-V CPU 的突破性进展

llm-inference transformer ai-accelerator

0.0 (0 votes)

2026-03-20 #research #ai

ZipServ: 硬件感知的无损压缩加速 LLM 推理

ZipServ: Fast and Memory-Efficient LLM Inference with Hardware-Aware Lossless Compression

memory-system llm-inference transformer

0.0 (0 votes)

2026-03-17 #research #ai

Mozart: Modularized and Efficient MoE Training on 3.5D Wafer-Scale Chiplet Architectures

edge-ai transformer llm-inference

0.0 (0 votes)

2026-03-17 #research #ai

HyperOffload: 图驱动的分层内存管理让大模型突破显存限制

memory-system llm-inference transformer

0.0 (0 votes)

2026-03-16 #research #ai

PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation

transformer multi-modal llm-inference

0.0 (0 votes)

2026-03-16 #research #ai

Orion: 苹果神经引擎 (ANE) 上的 LLM 训练与推理系统

Orion: Characterizing and Programming Apple’s Neural Engine for LLM Training and Inference

edge-ai transformer llm-inference

0.0 (0 votes)

2026-03-16 #research #ai

History-Guided Video Diffusion: 用历史引导实现超长视频生成

multi-modal diffusion-model llm-inference

0.0 (0 votes)

2026-03-16 #research #ai

GOMA: 通过解析建模实现空间加速器的几何最优映射

GOMA: Geometrically Optimal Mapping via Analytical Modeling for Spatial Accelerators

llm-inference transformer ai-accelerator

0.0 (0 votes)

2026-03-14 #research #ai

Synthesis-in-the-Loop Evaluation of LLMs for RTL Generation: Quality, Reliability, and Failure Modes

transformer llm-inference ai-accelerator

0.0 (0 votes)

2026-03-13 #research #ai

TOM: 三元只读存储器加速器赋能边缘智能大模型

edge-ai transformer quantization

0.0 (0 votes)

2026-03-13 #research #ai

SNAP-V: 面向小型脉冲神经网络的可配置神经形态 RISC-V SoC

low-power edge-ai neuromorphic

0.0 (0 votes)

2026-03-13 #research #ai

ROMA: 基于只读存储器的 QLoRA 边缘设备 LLM 加速器

edge-ai transformer low-power

0.0 (0 votes)

2026-03-13 #research #ai

MedBayes-Lite: 临床 Transformer 的轻量级贝叶斯不确定性量化框架

llm-inference transformer ai-accelerator

0.0 (0 votes)

2026-03-13 #research #ai

LLM 推理硬件的挑战与研究方向：内存与互连是核心瓶颈

edge-ai transformer low-power

0.0 (0 votes)

2026-03-13 #research #ai

LEGOSim: 多芯片异构集成的统一并行仿真框架

chiplet ai-accelerator llm-inference

0.0 (0 votes)

2026-03-11 #llm #accelerator

Taalas: 模型专用硬件 - 将AI模型转化为硅芯片

memory-system edge-ai llm-inference

0.0 (0 votes)

2026-03-11 #llm #accelerator

Taalas: Model-Specialized Hardware - Turning AI Models into Silicon

edge-ai transformer quantization

0.0 (0 votes)

2026-03-11 #llm #accelerator

ROMA: 基于ROM的QLoRA边缘设备LLM加速器

edge-ai transformer low-power

0.0 (0 votes)

2026-03-11 #neural-symbolic #ai-hardware

Neural-Symbolic AI Hardware: Unifying Pattern Learning and Logic

Why this direction matters

ai-accelerator robotics world-model

0.0 (0 votes)

2026-03-11 #llm #accelerator

Hardwired LLM Accelerators: From Programmable Kernels to Fixed-Flow Inference

Motivation

edge-ai transformer quantization

0.0 (0 votes)

2026-03-11 #diffusion #accelerator

Diffusion Model Accelerators: Efficient Sampling Beyond Brute-Force Denoising

Problem framing

transformer multi-modal llm-inference

0.0 (0 votes)

2026-03-11 #chiplet #3d-integration

3D Chiplet Systems for AI: Bandwidth-Centric Compute Integration

Why chiplets for AI now

edge-ai transformer llm-inference

0.0 (0 votes)

2025-05-22 #research #ai-accelerator

HSCO-Bench: 首个端到端硬件软件协同设计基准测试

HSCO-Bench: An Agent-Driven End-to-End Hardware-Software Co-design Benchmark for Systems-on-Chip

ai-accelerator llm-inference benchmark

0.0 (0 votes)

2025-05-22 #research #llm-inference

CPPL: 面向 LLM 的电路提示编程语言

CPPL: A Circuit Prompt Programming Language

llm-inference ai-accelerator benchmark

0.0 (0 votes)

2025-04-28 #research #ai-accelerator

DeepStack: 分布式3D堆叠AI加速器的设计空间探索框架

ai-accelerator chiplet llm-inference

0.0 (0 votes)

2025-04-23 #research #neural-symbolic

A Scalable Approach to Probabilistic Neuro-Symbolic Robustness Verification

neural-symbolic autonomous-driving robotics

0.0 (0 votes)

2025-04-07 #research #ai

MicroScopiQ: 通过异常值感知微缩放量化加速基础模型

quantization ai-accelerator llm-inference

0.0 (0 votes)