Data & Adaptive Intelligence Systems Lab
DAIS Lab, Korea University
Research Area
We are interested in broad topics in Data Science (DS) and Artificial Intelligence (AI). We identify real-world challenges with significant practical impacts and address them through DS/AI methodologies by leveraging {Evolving, (Un)structured data} × {Foundation models} × {Human-curated knowledge}.
Evolving Data
- Anomaly & drift detection KDD24, WWW24, KDD22, SIGMOD21, KDD20, VLDB19
- Time-series analysis WWW26, KDD25, NeurIPS24, WWW24, ICML23, ICLR22, MiLeTS19
- Streaming text & event processing KDD26, SIGIR23, WWW23, MiLeTS19, ICDE19
(Un)structured Data
- Graph & network mining SIGMOD26, WSDM26, CIKM24
- Spatio-temporal networks KBS25, KDD25, ICWSM25, TITS23, ICDM22
- Tabular data understanding ICML26, EMNLP25, SIGIR25
Foundation Models
- Pretraining & feature engineering WSDM26, EMNLP25, SIGIR25
- Prompt tuning & continual learning KDD26, SIGIR26, ICML24
- Retrieval & augmentation KDD26, ICML26, ACL26, SIGIR26
Human-curated Knowledge
- Taxonomy & topic discovery NC26, ACL26, SIGIR26, ACL23, EMNLP22, WWW22
- Societal & behavioral analysis TCSS25, ICDM22, AAAI22, CHI16
- Weak-supervision & summarization EMNLP23, WWW23
Publications
-
Can Structural Cues Save LLMs? Evaluating Language Models in Massive Document Streams
KDD26 | ACM SIGKDD Conference on Knowledge Discovery and Data Mining, August 2026
-
CREAM: Continual Retrieval on Dynamic Streaming Corpora with Adaptive Soft Memory
KDD26 | ACM SIGKDD Conference on Knowledge Discovery and Data Mining, August 2026
-
Segment-driven Structural Induction and Semantic Alignment for Heterogeneous Tabular Representation
ICML26 | International Conference on Machine Learning, July 2026
-
Breaking the Reference Bottleneck via Learning to Rewrite Conversational Queries without Gold Reference
Passages
ICML26 | International Conference on Machine Learning, July 2026To appear
-
MUDY: Multi-Granular Dynamic Candidate Contextualization for Unsupervised Keyphrase Extraction
SIGIR26 | ACM SIGIR Conference on Research and Development in Information Retrieval, July 2026
-
SPRINT: Scalable and Predictive Intent Refinement for LLM-Enhanced Session-based Recommendation
SIGIR26 | ACM SIGIR Conference on Research and Development in Information Retrieval, July 2026
-
Why These Documents? Explainable Generative Retrieval with Hierarchical Category Paths
ACL26 (Findings) | Annual Meeting of the Association for Computational Linguistics, July 2026
-
Back to the Future: Look-ahead Augmentation and Parallel Self-Refinement for Time Series Forecasting
WWW26 (Short) | ACM The Web Conference, June 2026
-
AI-Driven Text Mining of the Female Reproductive System: Enabling Multiscale Biomedical Modeling and Personalized
Medicine
Nano Convergence, May 2026 (SCI(E), IF: 11)
-
LMSC: Local Sketch Modularity Optimisation for Size-Constrained Community Search in Networks
SIGMOD26 | ACM Conference on Management of Data, May 2026
-
Metadata Meets LLMs: Constructing Knowledge-Rich Citation Networks with CoT-Enhanced Representations
WSDM26 (Short) | ACM Conference on Web Search and Data Mining, February 2026
-
Sequence-aware Adaptive Graph Convolutional Recurrent Networks for Traffic Forecasting
KBS25 | Knowledge-Based Systems, November 2025
-
Multi-level Diagnosis and Evaluation for Robust Tabular Feature Engineering with Large Language Models
EMNLP25 (Findings) | Conference on Empirical Methods in Natural Language Processing, November 2025
-
Bi-Modal Learning for Networked Time Series
KDD25 | ACM SIGKDD Conference on Knowledge Discovery and Data Mining, August 2025
-
An Analysis of City Image by Exploiting Social Media: Toward a Deeper Understanding of Multiple Characteristics
and Their Temporal Changes using Machine Learning
TCSS25 | IEEE Transactions on Computational Social Systems, July 2025 (SCI(E), IF: 4.9)
-
HAETAE: In-domain Table Pretraining with Header Anchoring
SIGIR25 (Short) | ACM SIGIR Conference on Research and Development in Information Retrieval, July 2025
-
Mobility Networked Time-Series Forecasting Benchmark Datasets
ICWSM25 | AAAI International Conference on Web and Social Media, June 2025
-
Exploiting Representation Curvature for Boundary Detection in Time Series
NeurIPS24 | Conference on Neural Information Processing Systems, December 2024
-
Flexi-clique: Exploring Flexible and Sub-linear Clique Structures
CIKM24 (Short) | ACM Conference on Information and Knowledge Management, October 2024
-
Online Drift Detection with Maximum Concept Discrepancy
KDD24 | ACM SIGKDD Conference on Knowledge Discovery and Data Mining, August 2024
-
One Size Fits All for Semantic Shifts: Adaptive Prompt Tuning for Continual Learning
ICML24 | International Conference on Machine Learning, July 2024
-
Breaking the Time-Frequency Granularity Discrepancy in Time-Series Anomaly Detection
WWW24 | ACM The Web Conference, May 2024
-
MEGClass: Text Classification with Extremely Weak Supervision via Mutually-Enhancing Text Granularities
EMNLP23 (Findings) | Conference on Empirical Methods in Natural Language Processing, December 2023
-
DynaMiTE: Discovering Explosive Topic Evolutions with User Guidance
ACL23 (Findings) | Annual Meeting of the Association for Computational Linguistics, July 2023
-
Context Consistency Regularization for Label Sparsity in Time Series
ICML23 | International Conference on Machine Learning, July 2023
-
Unsupervised Story Discovery from Continuous News Streams via Scalable Thematic Embedding
SIGIR23 | ACM SIGIR Conference on Research and Development in Information Retrieval, July 2023
-
PDSum: Prototype-driven Continuous Summarization of Evolving Multi-document Sets Stream
WWW23 | ACM The Web Conference, April 2023
-
SCStory: Self-supervised and Continual Online Story Discovery
WWW23 | ACM The Web Conference, April 2023
-
MG-TAR: Multi-view Graph Convolutional Networks for Traffic Accident Risk Prediction
TITS23 | IEEE Transactions on Intelligent Transportation Systems, 2023 (SCI(E), IF: 8.4)
-
Topic Taxonomy Expansion via Hierarchy-Aware Topic Phrase Generation
EMNLP22 (Findings) | Conference on Empirical Methods in Natural Language Processing, December 2022
-
Multi-view POI-level Cellular Trajectory Reconstruction for Digital Contact Tracing of Infectious Diseases
ICDM22 (Short) | IEEE International Conference on Data Mining, November 2022
-
Adaptive Model Pooling for Online Deep Anomaly Detection from a Complex Evolving Data Stream
KDD22 | ACM SIGKDD Conference on Knowledge Discovery and Data Mining, August 2022
-
Coherence-based Label Propagation over Time Series for Accelerated Active Learning
ICLR22 | International Conference on Learning Representations, April 2022
-
TaxoCom: Topic Taxonomy Completion with Hierarchical Discovery of Novel Topic Clusters
WWW22 | ACM The Web Conference, April 2022
-
COVID-EENet: Predicting Fine-Grained Impact of COVID-19 on Local Economies
AAAI22 | AAAI Conference on Artificial Intelligence, February 2022
-
Multiple Dynamic Outlier-Detection from a Data Stream by Exploiting Duality of Data and Queries
SIGMOD21 | ACM Conference on Management of Data, June 2021
-
Ultrafast Local Outlier Detection from a Data Stream with Stationary Region Skipping
KDD20 | ACM SIGKDD Conference on Knowledge Discovery and Data Mining, August 2020
-
NETS: Extremely Fast Outlier Detection from a Data Stream via Set-Based Processing
VLDB19 | International Conference on Very Large Data Bases, August 2019
-
MLAT: Metric Learning for kNN in Streaming Time Series
MiLeTS19 (KDD Workshop) | Workshop on Mining and Learning from Time Series, August 2019
-
CEP-Wizard: Automatic Deployment of Distributed Complex Event Processing
ICDE19 (Demo) | IEEE International Conference on Data Engineering, April 2019
-
Social or Financial Goals? Comparative Analysis of User Behaviors in Couchsurfing and Airbnb
CHI16 (Late-Breaking Work) | ACM Conference on Human Factors in Computing Systems, May 2016