Who am I?
Highly motivated graduate with expertise in statistics and quantitative psychology. Have a keen interest in modeling (e.g., adaptive methods, drift diffusion model, item response theory, and knowledge space theory). Research assistant with five years of experience in analyzing behavioral and fMRI data.
Precision and Robustness in Adaptive Testing
Dissertation: An IRT Analysis within an Online Convex Optimization Framework
Posted on June 1, 2025
| 1 minutes
| 106 words
| Hau-Hung Yang
My dissertation addresses the finite-sample performance of Computerized Adaptive Testing (CAT) algorithms by incorporating the Online Convex Optimization (OCO) framework. It focuses on two complementary themes. On the precision front, I demonstrate that classical information-maximization methods constitute no-regret algorithms under the OCO framework, and I construct finite-sample confidence intervals with anytime-valid properties using martingale concentration inequalities and confidence sequence theory. On the robustness front, I adapt the Upper Confidence Bound (UCB) algorithm from multi-armed bandit literature to design stability-oriented item selection strategies that account for the exploration-exploitation trade-off, addressing premature commitment bias from early-stage estimation error and performance degradation from uncertain item parameters during online calibration.
[Read More]
Labor Market Sorting in Taiwan
Wage inequality and firm-worker matching
Posted on March 1, 2025
| 1 minutes
| 67 words
| Hau-Hung Yang
Using comprehensive administrative data from Taiwan’s Ministry of Finance, we analyze wage inequality trends in Taiwan through the lens of the Abowd-Kramarz-Margolis (AKM) model. This framework allows us to decompose wages into worker fixed effects and firm fixed effects, revealing the extent to which the sorting of workers across firms contributes to overall wage dispersion. This paper has been accepted for publication in the Taiwan Economic Review.
Dog, Cat, Baby
Posted on September 1, 2024
| 2 minutes
| 223 words
| Hau-Hung Yang
In most advanced economies, economic prosperity has paradoxically been accompanied by a sustained decline in fertility rates. This pattern is also evident in Taiwan. At the same time, Taiwan has seen a steady increase in both the number of pet-owning households and the average expenditure per pet. These opposing trends give rise to a central research question: are pets substitutes for children, or do they function as complements?
Some scholars and observers have posited that pet ownership may foster the desire to establish a family and enter into marriage, potentially leading to higher fertility rates.
[Read More]
Household tax optimization
Posted on September 1, 2024
| 1 minutes
| 162 words
| Hau-Hung Yang
This study aims to examine the economic decision-making behavior within households, with a particular focus on whether individuals engage in intra-household communication to achieve decisions that are Pareto efficient—that is, outcomes in which no individual can be made better off without making another worse off. The research contrasts two behavioral scenarios: one in which household members coordinate and negotiate with the objective of maximizing collective welfare, and another in which individuals act independently, pursuing unilateral utility maximization without mutual consultation.
[Read More]
Extension of traditional random utility model
Posted on May 1, 2023
| 0 minutes
| 0 words
| Hau-Hung Yang
Evaluation of SE in IRT
methodology issue
Posted on March 1, 2023
| 1 minutes
| 104 words
| Hau-Hung Yang
We propose a precise and practical way to calculate standard errors of parameters in the Rasch model. Because the information function is the asymptotic variance of the maximum likelihood estimator (MLE), it may be far from the variance of MLE in a finite sample. We advocate the use of a plugin estimator of empirical variance rather than the information function to calculate the standard error. In the study, we bring forth a novel technique to simplify the computation involved in estimating the empirical variance, which lowers the cost of the estimation.
[Read More]
College preferences of high school student in Taiwan
Panel data, incomplete data
Posted on November 1, 2022
| 1 minutes
| 120 words
| Hau-Hung Yang
We look into high school students’ college preferences. In order to achieve this, we use the empirical data, students’ decision behavior throughout the application process for schools, and estimate the parameters of the utility model by maximum likelihood estimation. Due to the fact that every student considers a variety of options when making a decision, the empirical data is nonidentical, which complicates the likelihood function. In our research, we need to estimate more than 2300 parameters from the large and nonidentical samples (240000 students).
[Read More]
Identifiability of SEM
Posted on October 1, 2022
| 2 minutes
| 227 words
| Hau-Hung Yang
The family of polychoric models (PM) considers ordinal data as categorization of latent multivariate normal variables. Such framework is commonly used to study the association between ordinal variables, often leading to the polychoric correlation model (PCM). Moreover, PM subsumes several psychometric models, such as the graded response model (Samejima, 1968; 1997). However, the property of identifiability of PM has not been addressed in the literature. To make the issue more complicated, the normality assumption underlying PM has been challenged recently; researchers have suggested that the latent variables underlying PM could be generalized to elliptical distributions.
[Read More]
Adaptive Testing under item response theory
Application to Online platform, PaGamO
Posted on September 1, 2022
| 1 minutes
| 107 words
| Hau-Hung Yang
In this study, we work together with the educational platform PaGamO to create a workable E-Learning system. The purpose of this study is to enhance rural kids’ academic performance and support teachers in assessing their students’ progress. In the platform, the problems answered by students are randomly selected from a large item bank that contains more than one million items. And students only respond to a small proportion of items. As a result, the data is sparse, but the sample size is large (more than 100 thousand students).
[Read More]