Search Results for author: Kevin Zhu

Found 34 papers, 6 papers with code

FAIRE: Assessing Racial and Gender Bias in AI-Driven Resume Evaluations

1 code implementation2 Apr 2025 Athena Wen, Tanush Patil, Ansh Saxena, Yicheng Fu, Sean O'Brien, Kevin Zhu

In an era where AI-driven hiring is transforming recruitment practices, concerns about fairness and bias have become increasingly important.

Fairness

CLEAR: Contrasting Textual Feedback with Experts and Amateurs for Reasoning

no code implementations24 Mar 2025 Andrew Rufail, Daniel Kim, Sean O'Brien, Kevin Zhu

We introduce CLEAR (Contrasting Textual Feedback with Experts and Amateurs for Reasoning), a novel approach to language model reasoning that leverages the strengths of a larger (expert) model and smaller (amateur) model.

Language Modeling Language Modelling +1

EnDive: A Cross-Dialect Benchmark for Fairness and Performance in Large Language Models

no code implementations25 Feb 2025 Abhay Gupta, Jacob Cheung, Philip Meng, Shayan Sayyed, Austen Liao, Kevin Zhu, Sean O'Brien

The diversity of human language, shaped by social, cultural, and regional influences, presents significant challenges for natural language processing (NLP) systems.

Diversity Fairness +2

Pause-Tuning for Long-Context Comprehension: A Lightweight Approach to LLM Attention Recalibration

no code implementations1 Feb 2025 James Begin, Namit Agrawal, Eshan Singh, Yicheng Fu, Sean O'Brien, Vasu Sharma, Kevin Zhu

LLMs have demonstrated remarkable proficiency in understanding tasks but continue to struggle with long-context comprehension, particularly with content located in the middle of extensive inputs.

Enhancing Knowledge Distillation for LLMs with Response-Priming Prompting

1 code implementation18 Dec 2024 Vijay Goyal, Mustafa Khan, Aprameya Tirupati, Harveer Saini, Michael Lam, Kevin Zhu

We find that Ground Truth prompting results in a 55\% performance increase on GSM8K for a distilled Llama 3. 1 8B Instruct compared to the same model distilled without prompting.

GSM8K Knowledge Distillation +1

Pragmatic Metacognitive Prompting Improves LLM Performance on Sarcasm Detection

no code implementations4 Dec 2024 Joshua Lee, Wyatt Fong, Alexander Le, Sur Shah, Kevin Han, Kevin Zhu

Sarcasm detection is a significant challenge in sentiment analysis due to the nuanced and context-dependent nature of verbiage.

Sarcasm Detection Sentiment Analysis

Turkey's Earthquakes: Damage Prediction and Feature Significance Using A Multivariate Analysis

no code implementations29 Oct 2024 Shrey Shah, Alex Lin, Scott Lin, Josh Patel, Michael Lam, Kevin Zhu

Accurate damage prediction is crucial for disaster preparedness and response strategies, particularly given the frequent earthquakes in Turkey.

Introducing MAPO: Momentum-Aided Gradient Descent Prompt Optimization

no code implementations25 Oct 2024 Anthony Cui, Pranav Nandyalam, Ethan Cheung, Kevin Zhu

Momentum-Aided Prompt Optimization (MAPO) enhances the efficiency and efficacy of prompt optimization for Large Language Models (LLMs).

Prompt Engineering

A Debate-Driven Experiment on LLM Hallucinations and Accuracy

no code implementations25 Oct 2024 Ray Li, Tanishka Bagade, Kevin Martinez, Flora Yasmin, Grant Ayala, Michael Lam, Kevin Zhu

Large language models (LLMs) have achieved a degree of success in generating coherent and contextually relevant text, yet they remain prone to a significant challenge known as hallucination: producing information that is not substantiated by the input or external knowledge.

Fact Checking Hallucination +2

ChunkRAG: Novel LLM-Chunk Filtering Method for RAG Systems

no code implementations25 Oct 2024 Ishneet Sukhvinder Singh, Ritvik Aggarwal, Ibrahim Allahverdiyev, Muhammad Taha, Aslihan Akalin, Kevin Zhu, Sean O'Brien

Retrieval-Augmented Generation (RAG) systems using large language models (LLMs) often generate inaccurate responses due to the retrieval of irrelevant or loosely related information.

Chunking Fact Checking +3

Medical Imaging Complexity and its Effects on GAN Performance

1 code implementation23 Oct 2024 William Cagas, Chan Ko, Blake Hsiao, Shryuk Grandhi, Rishi Bhattacharya, Kevin Zhu, Michael Lam

The proliferation of machine learning models in diverse clinical applications has led to a growing need for high-fidelity, medical image training data.

Image Generation

Spiking Neural Networks as a Controller for Emergent Swarm Agents

no code implementations21 Oct 2024 Kevin Zhu, Connor Mattson, Shay Snyder, Ricardo Vega, Daniel S. Brown, Maryam Parsa, Cameron Nowzari

To do this, we use evolutionary algorithms to evolve not only the parameters (the weights, biases, and delays) of a spiking neural network, but also its structure.

Evolutionary Algorithms

QIANets: Quantum-Integrated Adaptive Networks for Reduced Latency and Improved Inference Times in CNN Models

1 code implementation14 Oct 2024 Zhumazhan Balapanov, Vanessa Matvei, Olivia Holmberg, Edward Magongo, Jonathan Pei, Kevin Zhu

Convolutional neural networks (CNNs) have made significant advances in computer vision tasks, yet their high inference times and latency often limit real-world applicability.

Model Compression Tensor Decomposition

Semantic Self-Consistency: Enhancing Language Model Reasoning via Semantic Weighting

no code implementations10 Oct 2024 Tim Knappe, Ryan Li, Ayush Chauhan, Kaylee Chhua, Kevin Zhu, Sean O'Brien

Standard methods based on this framework aggregate the final decisions of these rationales but fail to utilize the semantic information detailed in the step-by-step reasoning paths.

Language Modeling Language Modelling

Fine-Tuning Language Models for Ethical Ambiguity: A Comparative Study of Alignment with Human Responses

no code implementations10 Oct 2024 Pranav Senthilkumar, Visshwa Balasubramanian, Prisha Jain, Aneesa Maity, Jonathan Lu, Kevin Zhu

Significant improvements were observed after fine-tuning, with notable enhancements in both cross-entropy and Dirichlet scores, particularly in the latter.

Moral Scenarios

NusaMT-7B: Machine Translation for Low-Resource Indonesian Languages with Large Language Models

no code implementations10 Oct 2024 William Tan, Kevin Zhu

Large Language Models (LLMs) have demonstrated exceptional promise in translation tasks for high-resource languages.

Machine Translation NMT +2

Time-series Crime Prediction Across the United States Based on Socioeconomic and Political Factors

no code implementations1 Sep 2024 Patricia Dao, Jashmitha Sappa, Saanvi Terala, Tyson Wong, Michael Lam, Kevin Zhu

Traditional crime prediction techniques are slow and inefficient when generating predictions as crime increases rapidly \cite{r15}.

Crime Prediction Prediction +1

AAVENUE: Detecting LLM Biases on NLU Tasks in AAVE via a Novel Benchmark

no code implementations27 Aug 2024 Abhay Gupta, Philip Meng, Ece Yurtseven, Sean O'Brien, Kevin Zhu

Detecting biases in natural language understanding (NLU) for African American Vernacular English (AAVE) is crucial to developing inclusive natural language processing (NLP) systems.

Language Modeling Language Modelling +2

Intraoperative Glioma Segmentation with YOLO + SAM for Improved Accuracy in Tumor Resection

no code implementations27 Aug 2024 Samir Kassam, Angelo Markham, Katie Vo, Yashas Revanakara, Michael Lam, Kevin Zhu

Preoperative Magnetic Resonance Imaging (MRI) images are often ineffective during surgery due to factors such as brain shift, which alters the position of brain structures and tumors.

Brain Tumor Segmentation Segmentation +1

Enhancing Depression Diagnosis with Chain-of-Thought Prompting

no code implementations26 Aug 2024 Elysia Shi, Adithri Manda, London Chowdhury, Runeema Arun, Kevin Zhu, Michael Lam

When using AI to detect signs of depressive disorder, AI models habitually draw preemptive conclusions.

Improving Water Quality Time-Series Prediction in Hong Kong using Sentinel-2 MSI Data and Google Earth Engine Cloud Computing

no code implementations26 Aug 2024 Rohin Sood, Kevin Zhu

Effective water quality monitoring in coastal regions is crucial due to the progressive deterioration caused by pollution and human activities.

Cloud Computing Time Series +1

Chain-of-Thought Augmentation with Logit Contrast for Enhanced Reasoning in Language Models

no code implementations4 Jul 2024 Jay Shim, Grant Kruttschnitt, Alyssa Ma, Daniel Kim, Benjamin Chek, Athul Anand, Kevin Zhu, Sean O'Brien

Rapidly increasing model scales coupled with steering methods such as chain-of-thought prompting have led to drastic improvements in language model reasoning.

Language Modeling Language Modelling

Improving LLM Abilities in Idiomatic Translation

no code implementations3 Jul 2024 Sundesh Donthi, Maximilian Spencer, Om Patel, Joon Doh, Eid Rodan, Kevin Zhu, Sean O'Brien

Our research performs translations using two methods: The first method employs the SentenceTransformers model to semantically generate cosine similarity scores between the meanings of the original and target language idioms, selecting the best idiom (Cosine Similarity method).

Translation

Atom-by-atom protein generation and beyond with language models

no code implementations16 Aug 2023 Daniel Flam-Shepherd, Kevin Zhu, Alán Aspuru-Guzik

However, they are constrained to generate proteins with only the set of amino acids represented in their vocabulary.

FenceNet: Fine-grained Footwork Recognition in Fencing

no code implementations20 Apr 2022 Kevin Zhu, Alexander Wong, John McPhee

FenceNet takes 2D pose data as input and classifies actions using a skeleton-based action recognition approach that incorporates temporal convolutional networks to capture temporal information.

Action Recognition Feature Engineering +1

Keeping it Simple: Language Models can learn Complex Molecular Distributions

1 code implementation6 Dec 2021 Daniel Flam-Shepherd, Kevin Zhu, Alán Aspuru-Guzik

In this work, we investigate the capacity of simple language models to learn distributions of molecules.

Language Modelling

Understanding Childhood Vulnerability in The City of Surrey

no code implementations25 Mar 2019 Cody Griffith, Varoon Mathur, Catherine Lin, Kevin Zhu

Understanding the community conditions that best support universal access and improved childhood outcomes allows ultimately to improve decision-making in the areas of planning and investment across the early stages of childhood development.

Decision Making

Cannot find the paper you are looking for? You can Submit a new open access paper.