Author: Nikhil

Nikhil
423 POSTS0 COMMENTS
Nikhil is an intern consultant at Marktechpost. He is pursuing an integrated dual degree in Materials at the Indian Institute of Technology, Kharagpur. Nikhil is an AI/ML enthusiast who is always researching applications in fields like biomaterials and biomedical science. With a strong background in Material Science, he is exploring new advancements and creating opportunities to contribute.

Uploading Datasets to Hugging Face: A Step-by-Step Guide

Part 1: Uploading a Dataset to Hugging Face Hub Introduction This part of the tutorial walks you through the process of uploading a custom dataset to...

OpenAI Introduces o3 and o4-mini: Progressing Towards Agentic AI with Enhanced Multimodal Reasoning

​Today, OpenAI introduced two new reasoning models—OpenAI o3 and o4-mini—marking a significant advancement in integrating multimodal inputs into AI reasoning processes.​ OpenAI o3: Advanced Reasoning...

MIT Researchers Introduce DISCIPL: A Self-Steering Framework Using Planner and Follower Language Models for Efficient Constrained Generation and Reasoning

Language models predict sequences of words based on vast datasets and are increasingly expected to reason and perform complex linguistic manipulations. Yet, despite their...

From Logic to Confusion: MIT Researchers Show How Simple Prompt Tweaks Derail LLM Reasoning

Large language models are increasingly used to solve math problems that mimic real-world reasoning tasks. These models are tested for their ability to answer...

Reflection Begins in Pre-Training: Essential AI Researchers Demonstrate Early Emergence of Reflective Reasoning in LLMs Using Adversarial Datasets

What sets large language models (LLMs) apart from traditional methods is their emerging capacity to reflect—recognizing when something in their response doesn’t align with...

Transformers Gain Robust Multidimensional Positional Understanding: University of Manchester Researchers Introduce a Unified Lie Algebra Framework for N-Dimensional Rotary Position Embedding (RoPE)

Transformers have emerged as foundational tools in machine learning, underpinning models that operate on sequential and structured data. One critical challenge in this setup...

Foundation Models No Longer Need Prompts or Labels: EPFL Researchers Introduce a Joint Inference Framework for Fully Unsupervised Adaptation Using Fine-Tuning and In-Context Learning

Foundation models, often massive neural networks trained on extensive text and image data, have significantly shifted how artificial intelligence systems handle language and vision...

Reasoning Models Know When They’re Right: NYU Researchers Introduce a Hidden-State Probe That Enables Efficient Self-Verification and Reduces Token Usage by 24%

Artificial intelligence systems have made significant strides in simulating human-style reasoning, particularly mathematics and logic. These models don't just generate answers—they walk through a...

Step by Step Guide on Converting Text to High-Quality Audio Using an Open Source TTS Model on Hugging Face: Including Detailed Audio File Analysis...

In this tutorial, we demonstrate a complete end-to-end solution to convert text into audio using an open-source text-to-speech (TTS) model available on Hugging Face....

This AI Paper from Salesforce Introduces VLM2VEC and MMEB: A Contrastive Framework and Benchmark for Universal Multimodal Embeddings

Multimodal embeddings combine visual and textual data into a single representational space, enabling systems to understand and relate images and language meaningfully. These embeddings...

Complete Guide: Working with CSV/Excel Files and EDA in Python

This hands-on tutorial will walk you through the entire process of working with CSV/Excel files and conducting exploratory data analysis (EDA) in Python. We'll...

Google Releases Agent Development Kit (ADK): An Open-Source AI Framework Integrated with Gemini to Build, Manage, Evaluate and Deploy Multi Agents

Google has released the Agent Development Kit (ADK), an open-source framework aimed at making it easier for developers to build, manage, and deploy multi-agent...