Tag: LLM

All the articles with the tag "LLM".

Amazon EC2 P6 Instances: Powering AI with NVIDIA Blackwell GPUs
Published:May 16, 2025 at 10:24 AM
Amazon's upcoming EC2 P6 instances, powered by NVIDIA Blackwell GPUs, aim to accelerate AI innovation by providing enhanced performance and cost-effectiveness for LLM training and inference on AWS.
NVIDIA Showcases Local LLM Power on RTX with LM Studio and Llama.cpp, Previews Blackwell Architecture
Published:May 8, 2025 at 02:13 PM
NVIDIA promotes running LLMs locally on RTX GPUs using tools like LM Studio and Llama.cpp, emphasizing privacy, latency, and customization benefits. The article teases the upcoming Blackwell architecture's enhanced AI capabilities.
Fujitsu and Supermicro Deepen AI Computing Partnership
Published:Apr 30, 2025 at 04:32 AM
Fujitsu and Supermicro are expanding their AI collaboration to deliver integrated hardware and software solutions optimized for generative AI and LLMs, aiming to accelerate AI adoption by enterprises.
Microsoft's BitNet: A Revolution in LLMs with Near-Lossless Compression and 1-bit Transformers
Published:Apr 20, 2025 at 07:05 PM
Microsoft's BitNet is a revolutionary 1-bit Transformer LLM achieving comparable performance to FP16 models while being significantly smaller and more energy-efficient, enabling wider accessibility to AI.
NVIDIA Optimizations Significantly Accelerate Meta's Llama 4 AI Model
Published:Apr 8, 2025 at 02:57 PM
NVIDIA has announced significant speed optimizations for Meta's upcoming Llama 4 large language model, resulting in faster training and inference, highlighting the crucial collaboration between hardware and AI model developers.
Chinese Start-Up Develops AI Framework to Reduce Dependence on NVIDIA GPUs
Published:Mar 17, 2025 at 10:10 AM
A new AI framework from a Tsinghua University-affiliated team offers significant performance improvements and reduced GPU dependency, challenging NVIDIA's market position in AI hardware, especially within the Chinese market.
M3 Ultra Chip Handles DeepSeek R1 Model with 671 Billion Parameters
Published:Mar 12, 2025 at 01:09 PM
The Apple M3 Ultra chip successfully runs the DeepSeek R1 model with 671 billion parameters, showcasing its powerful unified memory architecture and efficiency for local AI processing.
Foxconn Introduces 'FoxBrain' AI Model to Enhance Manufacturing and Supply Chain Management
Published:Mar 10, 2025 at 11:26 AM
Foxconn's 'FoxBrain' LLM, built on Meta's Llama 3.1 and trained with Nvidia's H100 GPUs, aims to enhance manufacturing and supply chain management, marking a significant advancement in AI applications for the company.

Tag: LLM

Amazon EC2 P6 Instances: Powering AI with NVIDIA Blackwell GPUs

NVIDIA Showcases Local LLM Power on RTX with LM Studio and Llama.cpp, Previews Blackwell Architecture

Fujitsu and Supermicro Deepen AI Computing Partnership

Microsoft's BitNet: A Revolution in LLMs with Near-Lossless Compression and 1-bit Transformers

NVIDIA Optimizations Significantly Accelerate Meta's Llama 4 AI Model

Chinese Start-Up Develops AI Framework to Reduce Dependence on NVIDIA GPUs

M3 Ultra Chip Handles DeepSeek R1 Model with 671 Billion Parameters

Foxconn Introduces 'FoxBrain' AI Model to Enhance Manufacturing and Supply Chain Management