Bo Sheng

PhD Candidate

Department of Computer Science
College of William and Mary

Home Research Publications Contact Links

My major research work resides in sensor networks and RFID systems, targeting efficiency and security problems. Here briefly describes some research projects that I have been involved in.

  • Data Storage in Sensor Networks
  • Security and Privacy in Sensor Networks
  • Data Mining in Sensor Networks
  • Security and Privacy in RFID Systems
  • Data Retrieval in RFID Systems
  • Other Research
  • Data Storage in Sensor Networks

    A sensor network often generates a large amount of raw data, among which only a small portion is desired as the reply to a user's query. A fundamental issue in this query/reply model is how to store numerous raw data and efficiently respond to a user's query. My research proposed a novel idea of deploying powerful sensors with large storage capacity, called storage nodes, to support in-network storage mechanism, where storage nodes are responsible for holding the raw data from nearby regular sensors and replying the query diffused from the sink. We thoroughly studied the storage placement problem on a tree topology and excogitated the best strategy for deploying storage nodes [MobiHoc '06]. We further investigated a derivative problem that allows a sensor network to reconstruct its topology after storage nodes are deployed. We published an approximation algorithm for this problem in [WASA '07].

    Related publications:
  • [MobiHoc '06] Data Storage Placement in Sensor Networks, Bo Sheng, Qun Li and Weizhen Mao. 7th ACM International Symposium on Mobile Ad Hoc Networking & Computing, Florence, Italy, May 22-25, 2006.
  • [WASA '07] An Approximation Algorithm for Data Storage Placement in Sensor Networks, Bo Sheng, Chiu C. Tan, Qun Li and Weizhen Mao. International Conference on Wireless Algorithms, Systems and Applications, Chicago, IL, August 1-3, 2007.
  • Security and Privacy in Sensor Networks

    My research in this area includes two projects. My first work [INFOCOM '08] considered the security and privacy threats in the in-network storage model presented above, especially when storage nodes are compromised and behave maliciously. We first designed a privacy-preserving storage protocol that only discloses partial data information to storage nodes. This information helps process the raw data for queries, but knowing this information from a compromised storage node is not sufficient for an adversary to breach the privacy. Additionally, we proposed a verifiable query protocol that enables the application to detect a false reply manipulated by a malicious storage node. My second project, a collaborative effort, is to implement generic public key primitives on sensors. Our implementation is based on Elliptic Curve Cryptography and exhaustively optimized for sensors to reduce the executing time. Based on it, we proposed an access control protocol for sensor networks [ICDCS '08]. Please refer to my colleague's web page for more details (WM-ECC).

    Related publications:
  • [INFOCOM '08] Verifiable Privacy-Preserving Range Query in Two-Tiered Sensor Networks, Bo Sheng and Qun Li. 27th Conference on Computer Communications, Phoenix, AZ, April 15-17, 2008.
  • [ICDCS '08] Comparing Symmetric-key and Public-key Based Schemes in Sensor Networks: A Case Study of User Access Control, Haodong Wang, Bo Sheng, Chiu C. Tan and Qun Li. 28th International Conference on Distributed Computing Systems, Beijing, China, June 17-20, 2008.
  • Data Mining in Sensor Networks

    Mining the large data repository generated by a sensor network for useful information is crucial and challenging. In a simple solution, the data collected by sensors can be transmitted to the sink for data mining analysis. This solution, however, consumes too much energy because the data volume can be extremely large. A good method should require little data transmission, but still achieve information extraction from the large amount of data distributed over the network. My work studied a classical data mining problem of outlier detection. An outlier represents a data that is very different from the others. The definition is not solely dependent on data values, but related to the distribution of other data in the pool. Outlier detection represents a category of complicated data mining queries that are difficult for a sensor network to efficiently respond, because these queries inherently require a lot of energy-expensive data exchanges among sensors. We proposed a novel multi-round filtering scheme based on data histogram technique. Our scheme can accurately find all outlier data with significantly reduced energy cost. This work has been published in [MobiHoc '07], and its methodology can also be extended to other similar data mining problems.

    Related publications:
  • [MobiHoc '07] Outlier Detection in Sensor Networks, Bo Sheng, Qun Li, Weizhen Mao and Wen Jin. 8th ACM International Symposium on Mobile Ad Hoc Networking & Computing, Montreal, Canada, September 9-14, 2007.
  • Security and Privacy in RFID Systems

    In an RFID system, passive RFID tags are weak hardware with no micro-controller or battery. The existing security mechanisms for wireless networks can hardly be implemented on them. Our work first investigated basic security mechanisms and proposed a mutual authentication scheme between an RFID reader and an RFID tag. Different from previous research, our solution is more flexible without involving a third party server [PerCom '07]. Furthermore, we considered another special security problem of detecting missing products in inventory control applications. Our solutions enable a server to detect the missing products even with untrusted RFID readers [ICDCS '08].

    Related publications:
  • [PerCom '07] Serverless Search and Authentication Protocols for RFID, Chiu C. Tan, Bo Sheng and Qun Li. 5th Annual IEEE Conference on Pervasive Computing and Communications, White Plains, NY, March 19-23, 2007.
  • [ICDCS '08] How to Monitor for Missing RFID Tags, Chiu C. Tan, Bo Sheng and Qun Li. 28th International Conference on Distributed Computing Systems, Beijing, China, June 17-20, 2008.
  • Data Retrieval in RFID Systems

    When scanning a large volume of RFID tags, signal collision is ineluctable and seriously hinders the time efficiency. In addition, some RFID applications may not require collecting all RFID data. In current approaches, however, scanning all RFID tags is the first step for all applications, followed by respective data analysis. My work first studied a typical data mining query of finding popular categories of products among numerous RFID tags. We designed randomized algorithms with group testing technique that efficiently achieve the goal without scanning all RFID tags [MobiHoc '08]. Another on-going project is about continuous scans in an RFID system. When multiple scanning processes at different locations or time points are necessary for a task, it is commonly seen that adjacent scans contain duplicated RFID tags. We proposed efficient approaches to launching continuous scans without collecting redundant RFID tags.

    Related publications:
  • [MobiHoc '08] Finding Popular Categories for RFID Tags, Bo Sheng, Chiu C. Tan, Qun Li and Weizhen Mao. 9th ACM International Symposium on Mobile Ad Hoc Networking & Computing, Hong Kong SAR, China, May 26-30, 2008.
  • Other Research

    Search on Small Devices: We believe physical objects will be digitally annotated in a pervasive computing environment and the annotation information will be hosted at embedded devices. People can electronically search the physical world in such an environment. Searching the data stored on small devices is the building brick in this system and different from searching data on computers. First, small devices are usually equipped with flash memory instead of disk as the storage media. Second, small devices have limited RAM space for buffering data. Considering these two factors, my colleague and We implemented a textual search protocol specifically designed for small devices. Our search protocol regulates the format for the stored data and special data structures for maintaining keywords. Our protocol also includes a RAM-efficient search algorithm that can rank the search results based on the typical information retrieval metrics. Our experiments on sensors illustrate an impressive search performance. This work has been published in [Pervasive '08].

    BGP Security: Border Gateway Protocol is a core component of the Internet, and maintains the information about reachability among autonomous systems (AS). The original BGP is vulnerable to various attacks that modify the AS information contained in BGP messages. My colleague and I proposed new protection schemes based on key-chained signatures. We implemented our solutions and evaluated them by real experiments. The results indicate that our solutions are feasible for incremental deployment, and efficient in term of processing speed, storage, and bandwidth cost. This work has been published in [IWQoS '07].

    Related publications:
  • [Pervasive '08] Microsearch: When Search Engines Meet Small Devices, Chiu C. Tan, Bo Sheng, Haodong Wang and Qun Li. 6th International Conference on Pervasive Computing, Sydney, Australia, May 19-22, 2008.
  • [IWQoS '07] Securing BGP through Keychain-based Signatures, Heng Yin, Bo Sheng, Haining Wang and Jianping Pan. 15th IEEE International Workshop on Quality of Service, Chicago, IL, June 21-22, 2007.