关键词:概率;统计数据;算法;大型数据集
摘 要:The increasing size of our datasets|and perhaps more importantly, the increasing complexity of the underlying distributions that we hope to understand|are exposing issues that seem to demand computational consideration. In this dissertation, we apply the computational perspective to three basic statistical questions which underlie and abstract several of the challenges encountered in the analysis of today's large datasets.