Author: Aswin Ak

Aswin Ak
205 POSTS0 COMMENTS
Aswin AK is a consulting intern at MarkTechPost. He is pursuing his Dual Degree at the Indian Institute of Technology, Kharagpur. He is passionate about data science and machine learning, bringing a strong academic background and hands-on experience in solving real-life cross-domain challenges.

Hugging Face Releases OlympicCoder: A Series of Open Reasoning AI Models that can Solve Olympiad-Level Programming Problems

In the realm of competitive programming, both human participants and artificial intelligence systems encounter a set of unique challenges. Many existing code generation models...

From Genes to Genius: Evolving Large Language Models with Nature’s Blueprint

Large language models (LLMs) have transformed artificial intelligence with their superior performance on various tasks, including natural language understanding and complex reasoning. However, adapting...

This AI Paper from Google Unveils an AI System that Masters Disease Management and Medication Reasoning Better than Ever

Applying large language models (LLMs) in clinical disease management has numerous critical challenges. Although the models have been effective in diagnostic reasoning, their application...

Rethinking MoE Architectures: A Measured Look at the Chain-of-Experts Approach

Large language models have significantly advanced our understanding of artificial intelligence, yet scaling these models efficiently remains challenging. Traditional Mixture-of-Experts (MoE) architectures activate only...

MedHELM: A Comprehensive Healthcare Benchmark to Evaluate Language Models on Real-World Clinical Tasks Using Real Electronic Health Records

Large Language Models (LLMs) are widely used in medicine, facilitating diagnostic decision-making, patient sorting, clinical reporting, and medical research workflows. Though they are exceedingly...

Tencent AI Lab Introduces Unsupervised Prefix Fine-Tuning (UPFT): An Efficient Method that Trains Models on only the First 8-32 Tokens of Single Self-Generated Solutions

Unleashing a more efficient approach to fine-tuning reasoning in large language models, recent work by researchers at Tencent AI Lab and The Chinese University...

Elevating AI Reasoning: The Art of Sampling for Learnability in LLM Training

Reinforcement learning (RL) has been a core component in training large language models (LLMs) to perform tasks that involve reasoning, particularly mathematical problem-solving. A...

Microsoft AI Releases Phi-4-multimodal and Phi-4-mini: The Newest Models in Microsoft’s Phi Family of Small Language Models (SLMs)

In today’s rapidly evolving technological landscape, developers and organizations often grapple with a series of practical challenges. One of the most significant hurdles is...

Monte Carlo Tree Diffusion: A Scalable AI Framework for Long-Horizon Planning

Diffusion models are promising in long-horizon planning by generating complex trajectories through iterative denoising. However, their ability to improve performance through more computation at...

Hume Introduces Octave TTS: A New Text-to-Speech Model that Creates Custom AI Voices with Tailored Emotions

In the rapidly evolving field of digital communication, traditional text-to-speech (TTS) systems have often struggled to capture the full range of human emotion and...

Microsoft Researchers Introduces BioEmu-1: A Deep Learning Model that can Generate Thousands of Protein Structures Per Hour on a Single GPU

Proteins are the essential component behind nearly all biological processes, from catalyzing reactions to transmitting signals within cells. While advances like AlphaFold have transformed...

What are AI Agents? Demystifying Autonomous Software with a Human Touch

In today's digital landscape, technology continues to advance at a steady pace. One development that has steadily gained attention is the concept of the...