Research Intern - AIISC, University of South Carolina
• Developed the Alignment Quality Index (AQI), a contrastive, layer-attentive metric leveraging CKA-driven introspection to detect alignment-relevant representations beyond over-smoothed final embeddings. • Mitigated final-layer over-smoothing by optimizing layerwise weights to maximize inter-class separation in latent space between safe and unsafe completions - Work submitted to EMNLP’25