July 2007

Jul
26

Feature of the Week: Automatic Detection of Scrapeable Content

Related posts:Automatic visual inspection and defect detection on Variable Data Prints...Skin-Sensitive Automatic Color Correction...Detection of Textured Areas in Images Using a Disorganization Indicator Based on Component Counts...Object Detection in Video Streams Using Staggered Sampling...PaperDiff: A Script Independent Automatic Method for Finding The Text Differences Between Two Document Images...Network Worm Detection using Markov’s and Cantelli’s Inequalities...

Jul
24

Offline/Realtime Traffic Classification Using Semi-Supervised Learning

HPL-2007-121 Offline/Realtime Traffic Classification Using Semi-Supervised Learning - Erman, Jeffrey; Mahanti, Anirban; Arlitt, Martin; Cohen, Ira; Williamson, Carey
Keyword(s): traffic classification; semi-supervised learning; clustering
Abstract: Identifying and categorizing network traffic by application type is challenging because of the continued evolution of applications, especially of those with a desire to be undetectable. The diminished effectiveness of port-based identification and the overheads of deep packet inspection approaches m ...
Full Report Related posts:On semi-supervised learning and sparsity...On semi-supervised learning and sparsity...A Framework based on Semi-Supervised Clustering for Discovering Unique Writing Styles...A Novel Traffic Analysis for Identifying Search Fields in the Long Tail of Web Sites...BNS Scaling: An Improved Representation over TF.IDF for SVM Text Classification...Quantifying Counts, Costs, and Trends Accurately via Machine Learning...

Jul
24

YouTube Traffic Characterization: A View From the Edge

HPL-2007-119 YouTube Traffic Characterization: A View From the Edge - Gill, Phillipa, Arlitt, Martin; Li, Zongpeng; Mahanti, Anirban
Keyword(s): traffic characterization; YouTube; Web 2.0; caching
Abstract: This paper presents a traffic characterization study of the popular video sharing service, YouTube. Over a three month period we observed almost 25 million transactions between users on an edge network and YouTube, including more than 600,000 video downloads. We also monitored the globally popular v ...
Full Report Related posts:Offline/Realtime Traffic Classification Using Semi-Supervised Learning...Dynamic characterization of a large Web graph...Sum Rate Characterization of Joint Multiple Cell-Site Processing...Characterization of Noise in Digital Photographs for Image Processing...EtherApe...Force-Directed Edge Bundling for Graph Visualization...

Jul
24

Capacity Management and Demand Prediction for Next Generation Data Centers

HPL-2007-116 Capacity Management and Demand Prediction for Next Generation Data Centers - Gmach, Daniel; Rolia, Jerry; Cherkasova, Ludmila; Kemper, Alfons
Keyword(s): capacity management; next generation data centers; performance models; measurements; workload analysis; automation; enterprise applications; shared resource pools
Abstract: Advances in server, network, and storage virtualization are enabling the creation of resource pools of servers that permit multiple application workloads to share each server in the pool. This paper proposes and evaluates aspects of a capacity management process for automating the efficient use of s ...
Full Report Related posts:Water Efficiency Management in Datacenters (Part I): Introducing a water usage metric based on available energy consumption...Capacity and Performance Overhead in Dynamic Resource Allocation to Virtual Containers...JustRunIt: Experiment-Based Management of Virtualized Data Centers...vManage: Loosely Coupled Platform and Virtualization Management in Data Centers...Profiling Sustainability of Data Centers...A Dollar from 15 Cents: Cross-Platform Management for Internet Services...

Jul
24

“Merolyn the Phone”: A study of Bluetooth naming practices

HPL-2007-115 "Merolyn the Phone": A study of Bluetooth naming practices - Kindberg, Tim; Jones, Timothy
Keyword(s): bluetooth; electronic identity; naming; mobile phones
Abstract: This paper reports the results of an in-depth study of Bluetooth naming practices which took place in the UK in August 2006. There is a significant culture of giving Bluetooth names to mobile phones in the UK, and this paper's main contribution is to provide an account of those Bluetooth naming prac ...
Full Report Related posts:I, Me and My Phone: Identity and Personalization using Mobile Devices...Color naming: color scientists do it between Munsell Sheets of Color...Color naming: color scientists use Munsell Sheets of Color...“My iPod is my Pacifier”: An Investigation on the Everyday Practices of Mobile Video Consumption...Workshop: Tinkering, Tailoring, & Mashing: The Social and Collaborative Practices of the Read-Write Web...Nearby on Your Phone...

Jul
24

Mediascapes: Context-Aware Multimedia Experiences

HPL-2007-113 Mediascapes: Context-Aware Multimedia Experiences - Stenton, S. Philip; Wee, Susie; Hull, Richard; Goddi, Patrick M.; Reid, Josephine E.; Clayton, Ben J.C.; Melamed, Tom J.
Keyword(s): No keywords available.
Abstract: No abstract available. ...
Full Report Related posts:Chromatic Association: Context-Aware Device Association...Context-Aware Privacy Design Pattern Selection...Multimedia Experience on Web-Connected CE Devices...Using GPS to Attach Real World Coordinates to Maps...On Identity-Aware Devices: Putting Users in Control across Federated Services...A Search Engine Index for Multimedia Content...

Jul
24

Using GPS to Attach Real World Coordinates to Maps

HPL-2007-112 Using GPS to Attach Real World Coordinates to Maps - Melamed, Tom; Clayton, Ben
Keyword(s): GPS; locative media; context sensitive; mediascape; mscape; map; coordinate
Abstract: This paper discusses the requirements for map images such that they are suitable for the construction of location-based services and applications such as mediascapes. We then detail a specific class of maps that satisfy many of these requirements but may lack coordinate information. We show that fin ...
Full Report Related posts:A Real-Time Expectation Maximization Algorithm for Acquiring Multi-Planar Maps of Indoor Environments with Mobile Robots...Viral marketing in the real world...Flickr In The Real World – Instant Fave!...The Geotaggers’ World Atlas...Typographic Links...Paris Transportation Maps...

Jul
24

Endless Documents: a Publication as a Continual Function

HPL-2007-111 Endless Documents: a Publication as a Continual Function - Lumley, John; Gimson, Roger; Rees, Owen
Keyword(s): XML; XSLT; SVG; document construction; functional programming
Abstract: Variable data documents can be considered as functions of their bindings to values. The Document Description Framework (DDF) treats documents in this manner, using XSLT semantics to describe document functionality and a variety of related mechanisms to support layout, reference and so forth. But the ...
Full Report Related posts:Endless Documents: a Publication as a Continual Function...Cascaded Dynamic Templates for Active Documents...A Semantic Wiki for Continual Collaborative Information Management...On the rate distortion function of Bernoulli Gaussian sequences...Xebece...From XML Inclusions to XML Transclusions...

Jul
24

Ingestion Pipeline for RDF

HPL-2007-110 Ingestion Pipeline for RDF - Bhatia, Nipun; Seaborne, Andy
Keyword(s): ingestion pipeline; validation of RDF; inferencing; large RDF datasets
Abstract: In this report we present the design and implementation of an ingestion pipeline for RDF Datasets. Our definition of ingestion subsumes: validation and inferencing. The design proposed performs these tasks without loading the data in-memory. There are several reasoners and Lint like validators avail ...
Full Report Related posts:Denoising Scheme for Realistic Digital Photos from Unknown Sources...Characterization of Noise in Digital Photographs for Image Processing...

Jul
24

A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition

HPL-2007-109 A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition - Mandalapu, Dinesh; Murali Krishna, Sridhar
Keyword(s): shape contexts; features; handwriting recognition
Abstract: Feature extraction is a very important step in the process of character recognition. The features extracted from the character should encode the local, global and the structural characteristics of the character shape. In this paper we propose a new feature for recognition of online handwritten chara ...
Full Report Related posts:A Framework for Adaptation of the Active-DTW Classifier for Online Handwritten Character Recognition...A Skew-tolerant Strategy and Confidence Measure for k-NN Classification of Online Handwritten Characters...A Framework based on Semi-Supervised Clustering for Discovering Unique Writing Styles...Elastic Matching of Online Handwritten Tamil and Telugu Scripts Using Local Features...Feature of the Week: Character encoding of imported files...Machine Recognition of Online Handwritten Devanagari Characters...