-
4181.ADAM:云规模计算的基因组格式及处理模式
[信息传输、软件和信息技术服务业] [2014-03-18]
Current genomics data formats and processing pipelines are not designed to scale well to large datasets. The current Sequence/Binary Alignment/Map (SAM/BAM) formats were intended for single node processing. There have been attempts to adapt BAM to distributed computing environments, but they see limited scalability past eight nodes. Additionally, due to the lack of an explicit data schema, there are well known incompatibilities between libraries that implement SAM/BAM/Variant Call Format (VCF) data access. To address these problems, we introduce ADAM, a set of formats, APIs, and processing stage implementations for genomic data.
关键词:ADAM;基因组学;云计算;数据存储
-
4182.修改及补充现有图表以提升可用性技术研究
[信息传输、软件和信息技术服务业] [2014-03-18]
In order to modify charts, we need access to the locations of the marks in the chart and the underlying data. We present algorithms that automatically extract this information from raster bar and pie charts that obey some common assumptions. Using a corpus of images drawn from the web, these algorithms successfully extract marks from 79% of bar charts and 62% of pie charts, and from these charts they successfully extract the data from 71% of bar charts and 64% of pie charts. We then present an application that uses the extracted marks and data to present a gallery of redesigns. Next, we tackle the problem of customizing existing visualizations to best support a viewer’s goal. We introduce graphical overlays—visual elements that are layered onto charts to facilitate a larger set of chart reading tasks.
关键词:可视化;众包;图表;文本;认知模型
-
4183.社区蜂窝网络
[信息传输、软件和信息技术服务业] [2014-03-18]
To bring network connectivity to these remaining hundreds of millions of people, we introduce the concept of community cellular networks: small-scale, locally operated networks independent from traditional telecommunication firms. In support of these networks, we also introduce virtual coverage, a novel power saving mechanism in GSM networks.
关键词:蜂窝网络;虚拟覆盖;网络设置;社区通信
-
4184.数据交换问题:算法和复杂度
[信息传输、软件和信息技术服务业] [2014-03-18]
In this thesis we study the data exchange problem where a set of users is interested in gaining access to a common file, but where each has only partial knowledge about it as sideinformation.
关键词:数据交换;线性分组模型;广义关联;扩展
-
4185.通过傅立叶透镜观察图结构数据
[信息传输、软件和信息技术服务业] [2014-03-18]
In this work, we start by reviewing the notion of a Graph Fourier Transform (GFT), which has been defined in the literature for graph signals. We examine the spatial and spectral features of circulant graphs, which accommodate linear shift-invariant operations. We describe fundamental operations such as shifting, sampling, graph-reconnection and linear filtering for signals on circulant graphs and derive associated sampling and graph-reconnection theorems. We also develop wavelet filter bank structures for multi resolution analysis of large-scale graphs. We present a method to decompose an arbitrary graph into a linear combination of circulant graphs.
关键词:傅里叶透镜;图结构数据;循环图;数据处理;GFT
-
4186.参数恒定的专家数目在基于对数损耗的在线学习中极大极小最优问题
[信息传输、软件和信息技术服务业] [2014-03-18]
We focus on the minimax regret, which is the regret of the strategy with the minimum of the worst-case regret over outcome sequences.
关键词:在线学习;最大似然估计;在线预测;极大极小最优
-
4187.高生产率、高效率和可移植的并行计算的框架设计
[信息传输、软件和信息技术服务业] [2014-03-18]
In this dissertation, we present a software environment that aims to bridge the implementation gap and enable application writers to productively utilize parallel hardware by reusing the work of efficiency programmers.
关键词:并行计算;PyCASP;框架
-
4188.视觉偏差校正的光场显示计算
[信息传输、软件和信息技术服务业] [2014-03-18]
In this work, we introduce a new computation based aberration-correcting light field display: by incorporating the persons own optical aberration into the computation, we alter the content shown on the display, such that he or she will be able to see it in sharp focus without wearing eyewear. We analyze the image formation models; through the retinal light field projection, we find it is possible to compensate for the optical blurring on the target image by prefiltering with the inverse blur.
关键词:视觉偏差;偏差校正;光场
-
4189.PULSE:稀疏信号估计中基于剥离的超低复杂度算法
[信息传输、软件和信息技术服务业] [2014-03-18]
In this thesis, we consider the problem of computing a sparse Discrete-Fourier-Transform of a high-dimensional signal from its timedomain samples, as a representative example of compressed-sensing problems. We use this problem to investigate the tradeo between the number of measurements, noise robustness, and the computational complexity of the recovery algorithm in compressed sensing problems.
关键词:稀疏信号;算法;压缩传感;离散傅立叶变换
-
4190.高速存取自动点击与搜索劫持点击欺诈模块
[信息传输、软件和信息技术服务业] [2014-03-18]
In this report, we fill in some of these gaps by analyzing the “auto-clicking” and “search-hijacking” modules that drive most of ZeroAccess’s revenue creation. Using a combination of code analysis and empirical measurement, we document the distinct command and control protocols used by each module, the infrastructure they use, and how they operate to defraud online advertisers.
关键词:网络广告;点击欺诈;恶意软件;僵尸网络;自动点击组件;搜索劫持组件