Paper-Conference

CLUE: Conflict-guided Localization for LLM Unlearning Framework

The LLM unlearning aims to eliminate the influence of undesirable data without affecting causally unrelated information. This process typically involves using a forget set to …

hang-chen-jiaying-zhu-xinyu-yang-wenya-wang

• Jan 26, 2026 • 1 min read

Large Language Models

Skill Path: Unveiling Language Skills from Circuit Graphs

Circuit graph discovery has emerged as a fundamental approach to elucidating the skill mechanistic of language models. Despite the output faithfulness of circuit graphs, they …

hang-chen-xinyu-yang-jiaying-zhu-wenya-wang

• Jan 1, 2026 • 1 min read

Large Language Models

Rethinking Circuit Completeness in Language Models: AND, OR, and ADDER Gates

Circuit discovery has gradually become one of the prominent methods for mechanistic interpretability, and research on circuit completeness has also garnered increasing attention. …

hang-chen-jiaying-zhu-xinyu-yang-wenya-wang

• Dec 15, 2025 • 1 min read

Large Language Models

Quantifying Semantic Emergence in Language Models

Large language models (LLMs) are widely recognized for their exceptional capacity to capture semantics meaning. Yet, there remains no established metric to quantify this …

hang-chen-xinyu-yang-jiaying-zhu-wenya-wang

• Jul 1, 2025 • 1 min read

Large Language Models

Debiasing the Fine-Grained Classification Task in LLMs with Bias-Aware PEFT

Fine-grained classification via LLMs is susceptible to more complex label biases compared to traditional classification tasks. Existing bias mitigation strategies, such as …

daiying-zhao-xinyu-yang-hang-chen

• Jul 1, 2025 • 1 min read

Sentiment Analysis

How to enhance causal discrimination of utterances: A case on affective reasoning

Our investigation into the Affective Reasoning in Conversation (ARC) task highlights the challenge of causal discrimination. Almost all existing models, including large language …

hang-chen-xinyu-yang-jing-luo-wenjing-zhu

• Dec 1, 2023 • 1 min read

No results found

Paper-Conference

CLUE: Conflict-guided Localization for LLM Unlearning Framework

Skill Path: Unveiling Language Skills from Circuit Graphs

Rethinking Circuit Completeness in Language Models: AND, OR, and ADDER Gates

Quantifying Semantic Emergence in Language Models

Debiasing the Fine-Grained Classification Task in LLMs with Bias-Aware PEFT

How to enhance causal discrimination of utterances: A case on affective reasoning