Author: Tanya Malhotra

Tanya Malhotra
471 POSTS0 COMMENTS
Tanya Malhotra is a final year undergrad from the University of Petroleum & Energy Studies, Dehradun, pursuing BTech in Computer Science Engineering with a specialization in Artificial Intelligence and Machine Learning. She is a Data Science enthusiast with good analytical and critical thinking, along with an ardent interest in acquiring new skills, leading groups, and managing work in an organized manner.

IBM Researchers Introduce ST-WebAgentBench: A New AI Benchmark for Evaluating Safety and Trustworthiness in Web Agents

Large Language Model (LLM)--based online agents have significantly advanced in recent times, resulting in unique designs and new benchmarks that show notable improvements in...

How Large Language Models (LLMs) can Perform Multiple, Computationally Distinct In-Context Learning (ICL) Tasks Simultaneously

Large Language Models (LLMs) have demonstrated remarkable proficiency in In-Context Learning (ICL), which is a technique that teaches them to complete tasks using just...

Meet TurtleBench: A Unique AI Evaluation System for Evaluating Top Language Models via Real World Yes/No Puzzles

The need for efficient and trustworthy techniques to assess the performance of Large Language Models (LLMs) is increasing as these models are incorporated into...

Embodied Agent Interface: An AI Framework for Benchmarking Large Language Models (LLMs) for Embodied Decision Making

Large Language Models (LLMs) need to be evaluated within the framework of embodied decision-making, i.e., the capacity to carry out activities in either digital...

GORAM: A Graph-Oriented Data Structure that Enables Efficient Ego-Centric Queries on Federated Graphs with Strong Privacy Guarantees

Ego-centric searches are essential in many applications, from financial fraud detection to social network research, because they concentrate on a single vertex and its...

Google Cloud and Stanford Researchers Propose CHASE-SQL: An AI Framework for Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL

An essential bridge connecting human language and structured query languages (SQL) is text-to-SQL. With its help, users can convert their queries in normal language...

Refining Classifier-Free Guidance (CFG): Adaptive Projected Guidance for High-Quality Image Generation Without Oversaturation

Classifier-Free Guiding, or CFG, is a major factor in enhancing picture generation quality and guaranteeing that the output closely matches the input circumstances in...

Generative World Models for Enhanced Multi-Agent Decision-Making

Recent developments in generative models have paved the way for innovations in chatbots and picture production, among other areas. These models have demonstrated remarkable...

Enhancing Time-Series Analysis in Multimodal Models through Visual Representations for Richer Insights and Cost Efficiency

Multimodal foundation models, like GPT-4 and Gemini, are effective tools for a variety of applications because they can handle data formats other than text,...

Evaluating the Planning Capabilities of Large Language Models: Feasibility, Optimality, and Generalizability in OpenAI’s o1 Model

New developments in Large Language Models (LLMs) have shown how well these models perform sophisticated reasoning tasks like coding, language comprehension, and math problem-solving....

Optimizing Long-Context Processing with Role-RL: A Reinforcement Learning Framework for Efficient Large Language Model Deployment

Training Large Language Models (LLMs) that can handle long-context processing is still a difficult task because of data sparsity constraints, implementation complexity, and training...

Compositional Hardness in Large Language Models (LLMs): A Probabilistic Approach to Code Generation

A popular method when employing Large Language Models (LLMs) for complicated analytical tasks, such as code generation, is to attempt to solve the full...