2011 Agenda Abstracts

BEST OF FAST
BIRDS OF A FEATHER
CLOUD
CIFS/SMB/SMB2
DATA MANAGEMENT
/etc
FIBRE CHANNEL
FILE SYSTEM
GREEN
HOT TOPICS

KEY NOTE AND FEATURED SPEAKERS
NFS
PERFORMANCE
PLUGFESTS
SECURITY
SOLID STATE STORAGE
STORAGE MANAGEMENT
TESTING
VIRTUALIZATION

Best of Fast

A Study of Practical Deduplication

Bill Bolosky, Member, Microsoft Research

Abstract

CIFS/SMB/SMB2

We will also discuss auxiliary protocols which live side by side with the SMB 2.2 protocol to provide end-to-end reliability and manageability.

SMB 2.2: Bigger. Faster. Scalier - (Parts 1 and 2)

Mathew George, Sr. Software Development Engineer, Microsoft

Abstract

We will also discuss auxiliary protocols which live side by side with the SMB 2.2 protocol to provide end-to-end reliability and manageability.

Advancements in Backup to Support Application Storage on a File Server

Molly Brown, Principal Development Lead, Microsoft

Abstract

There are many compelling reasons for server applications, such as Hyper-V, to store their data on a file share, but this cannot be done if it compromises the application’s data backup and recovery strategy. This session will describe the new protocol, MS-FSRVP, designed for Windows Server 8 that allows an application server to drive the required coordination with a file server so that existing backup and recovery strategies continue to work as the application data moves from local to remote storage.

Learning Objectives

Understand the unique problems in the backup space when application data is stored on a file server
Understand the design goals for the new MS-FSRVP protocol
Get a detailed overview of the MS-FSRVP protocol

SMB 2.2 over RDMA

Thomas Talpey, Software Architect, Microsoft
Greg Kramer, Ph.D., Software Development Engineer, Microsoft

Abstract

A new protocol, SMB2 Direct, has been created which allows SMB 2.2 to operate over Remote Direct Memory Access (RDMA) transports such as iWARP, Infiniband and RoCE. This layering enables significant increases in performance for all SMB 2.2 file-based workloads and dramatically broadens the applicability of SMB 2.2. The presentation will outline the goals and motivations for the new approach, and will make a deep dive into the SMB2 Direct protocol itself, including early performance results.

SMB2 - Advancements for WAN

Molly Brown, Principal Development Lead, Microsoft
Mathew George, Sr. Software Development Engineer, Microsoft

Abstract

This session covers advancements in SMB2 file service deployments over a Wide Area Network (WAN). This includes discussions on enhancements for SMB2 file servers and SMB2 file clients in these scenarios, such as metadata optimizations and identification of potential file caching opportunities.

Accelerating SMB2

Mark Rabinovich, R&D Manager, Visuality Systems

Abstract

Global networks introduce a significant challenge when it comes to CIFS traffic, making it virtually unbearable for an end user. SMB2 is no less chatty than SMB, hence - it may be accelerated. We will show how to improve SMB/SMB2 traffic using various acceleration techniques. This presentation emphasizes on challenges introduced by SMB2 as in comparison with SMB. I will share our CIFS acceleration experience and performance statistics for accelerated WANs with SMB2 traffic

Learning Objectives

SMB2 traffic in WANs
SMB2 acceleration methods
SMB2 acceleration statistics
SMB2 acceleration examples

Samba Status Report

Volker Lendecke, Samba Team / SerNet,

Abstract

Samba is a rapidly evolving project that is part of the basis for many NAS vendors. This talk will give an overview of the current development of Samba.

Learning Objectives

Clustering: The current status of the active-active clustered CIFS server with Samba and ctdb will be presented.
SMB2: With Samba 3.6 SMB2 is a fully supported protocol in Samba
Printer support: The printer subsystem has been overhauled. Reasons for this overhaul and implications will be presented
Samba/AD: Samba 4 strives to become an AD compatible Domain Controller, while Samba 3 is a solid file and print server. Different ways to merge those two to go forward with Samba 4.0 will be presented.

A CIFS Geek in Exile: What I did on my Holiday

Christopher Hertel, Storage Architect, and CIFS Geek ubiqx Consulting, Inc.

Abstract

BranceCache is a distributed caching system implemented by Windows SMB2 servers. BITS, according to at least one Microsoft Blog, is the "Earth’s most widely used file transfer service". This presentation covers a new Open Source implementation of both BITS and BranchCache.

Learning Objectives

BITS protocol and how does it relate to SMB/CIFS/SMB2.
BranchCache, and how it relates to SMB/CIFS/SMB2 and BITS.

CTDB Status - Clustered Samba Growing Up

Michael Adam, Senior Software Engineer, Samba Team / SerNet

Abstract

CTDB is a highly specific clustered database and management software sitting between Samba and a cluster file system. It allows to create scaling CIFS/NFS clusters on Linux. An early self-contained implementer of all-active service clustering, CTDB now slowly finds its way into the Linux distributions as a managed resource of the Linux cluster stack. Initially, the problems that Samba is facing, when running on a file system cluster, are recalled as well as the design and history of the CTDB software. An overview is given of the past year's bigger changes in CTDB, especially transaction handling and vacuuming. The various modes in which CTDB can be run are described, and how major Linux distributions start to integrate CTDB in their cluster products.

Learning Objectives

Learn about clustered CIFS services with CTDB and Samba
Learn about new developments in CTDB / Clustered Samba
Learn about integration of CTDB into the Linux distributions

Experiences in Clustering CIFS for IBM Scale Out Network Attached Storage

Dr. Jens-Peter Akelbein, IBM Germany, Research and Development

Abstract

Clustering the CIFS and SMB2 protocol is enabling managing large scale data in a single name space scaling the bandwidth of access as well. IBM SONAS uses clustering across various nodes while scaling capacity indepently by a second tier of nodes. Experiences with clustering CIFS including the underlying clustered file system the past years led to improvements in regards of performance and stability. Utilizing SMB2 as the protocol leads to improvements beyond CIFS capabilities. Compared to traditional active-passive configurations larger clusters provide active-active configurations allowing flexible maintenance and management. This talk should give an insight on resolved performance challenges in applying clustered CIFS with different installations and workloads including improvements being made or currently applied to Samba and CTDB for being used in an Enterprise product.

Learning Objectives

Clustering and performance, how to achieve both
Requirements to clustered file systems by clustering NAS
Requirements to protocol implementations by clustering NAS
Benefits of clustering for managing Scale Out NAS devices

Hidden Gems in the NAS Protocols
James Cain, Principal Software Architect, Quantel Limited

Abstract

Having spent the last few years implementing SMB and SMB2 servers, the presenter has discovered that there are parts of these protocols that seem to offer untapped semantic richness. This session will propose theories and demonstrate practical working examples that test these theories. Examples will include: dynamically offering different representations of the same resource, extracting provenance from running software and avoiding NAS head state to support dynamic failover in a clustered file-system. All these examples will be built up from theoretical principals and demonstrated with working prototypes.

Learning Objectives

Understanding untapped NAS semantics
Exploiting COTS operating systems for innovative uses
Exploring how NAS installations can scale

Through the Looking Glass; Debugging CIFS/SMB/SMB2

Robert Randall, Senior Software Architect, Micron Technologies, Inc.
Christopher Hertel, Storage Architect, and CIFS Geek, ubiqx Consulting, Inc.

Abstract

While protocol suite tests are quite useful, there are other ways to understand how Windows is interacting with your SMB client or server. Built into the kernel of Windows is a treasure trove of telemetry which provides a rich context and clear complaints when redirectors are interacting with another end point. Watch as the treasure is revealed through simple examples that demonstrate the value of knowing what the Windows kernel can tell you about how your client or server is behaving. Leave with step by step instructions on how to use these valuable tools. The tools are freely available from Microsoft's web site.

Learning Objectives

Understand the tools required for Windows kernel debugging
Learn the steps required to turn on SMB/SMB2 redirector telemetry
Experience real examples of how the telemetry can help you to better understand your client or server and how it interacts with Windows

Lessons learned implementing a multi-threaded SMB2 server in OneFS

Aravind Velamur Srinivasan, Senior Software Engineer, Isilon Systems, Inc

Abstract

This talk will examine the lessons learned implementing SMB2 in the OneFS operating system and also highlight performance optimizations made in a multi-threaded SMB server implementation. In addition to these, the talk will also compare SMBv1 vs SMB2 in OneFS operating system to stress the performance benefits of using SMB2 over SMBv1. SMB2 offers a server-side credit mechanism to throttle greedy clients. Different credit algorithms can cause weird client behavior in certain scenarios. We’ll examine some common mistakes to avoid. The multi-threaded SMB servers have their own advantages and disadvantages. This presentation will throw light on the performance optimizations that can be made in a multi-threaded server implementation. In addition to the aforementioned points, the presentation will also highlight the inherent performance benefits achieved by using SMB2 protocol over SMBv1, by presenting some performance numbers of using SMBv1 vs SMB2 in OneFS operating system.

Learning Objectives

SMB2 implementation in OneFS
The SMB2 server credit algorithm and potential pitfalls in its implementation
Performance optimizations in a multi-threaded SMB server
Performance gains achieved by using SMB2 over SMBv1 in OneFS operating system

Implementing SMB 2.1 In Likewise Storage Services
Gerald Carter, CTO, Likewise Software

Abstract

After completing support for SMB2.0 and MS Vista clients, server implementers must turn their focus to the additional SMB2 protocol features utilized by Windows 7 and Windows 2008 R2 clients. This session will focus on experiences and knowledge gained from implementing SMB2.1 feature support in the Likewise Storage Services platform. Topics covered will include protocol dialect negotiation beyond SMB 2.0, concurrent support for Windows 7 leases and legacy oplocks, multi-credit I/O support, and persistent file handles.

Learning Objectives

Understand the design and implementation trade offs involved in managing SMB 2.1 Leases
Explain the requirement changes to a server’s credit handling regarding large I/O requests
Discuss support for durable handles vs. resilient handles in SMB2 and 2.1 servers

Thinking Inside the Box: Embedded Active Directory / Storage Appliances Based on Samba

Kai Blin, Embedded Developer, Samba Team

Abstract

In many SOHO setups, a central storage server or NAS device is already in use. Existing Open Source software makes it very easy to also move the Active Directory domain controller to the same machine, providing easy-to-use user management and file/print services to SOHO customers. This talk will describe a proof-of-concept implementation of an embedded Active Directory DC and SMB/CIFS file/print server for SOHO setups that can be administrated using a web interface or existing AD management tools. While the proof-of-concept implementation is limited to about a dozen clients, the same system is useable on more powerful hardware for bigger networks.

Learning Objectives

Using Samba to provide AD DC and file/print services
Scaling Samba down to embedded system constraints
Using a web interface to administer Samba

Moving an Enterprise Database Platform to run on CIFS/SMB/SMB2 File Access Protocols

Kevin Farlee, Storage Engine Program Manager, SQL Server, Microsoft

Abstract

There are a lot of considerations to go through when converting a performance-sensitive enterprise app designed to run against direct-attached or SAN hardware, and running it against Network Attached Storage. The Microsoft SQL Server Storage Engine team faced this problem when redesigning SQL Server to run over CIFS/SMB/SMB2 protocols to use NAS. I will discuss some of key issues we tackled: I’ll walk you through each of these stages, and discuss how this progressed in real life.

Learning Objectives

How are databases different from more typical NAS workloads?
Why would we do that? (What makes it worth the engineering investment?)
What are the considerations in moving to a new storage platform?
Code Changes – What product code had to change?
Testing – How does this impact automated test infrastructure?
Taking advantage of new capabilities

Cloud

Understand replication and coding as used in the cloud
Understand the basics of fault domains and resilient design
Separate the promise from the reality of cloud storage
Understand resilience of virtual vs. physical infrastructure
Understand how automated, policy-driven storage increases data resiliency

Changing Requirements for Distributed File Systems in Cloud Storage

Wesley Leggette, Cleversafe, Inc.,

Abstract

File systems typically have centralized metadata servers that present performance bottlenecks as concurrent users and system size increase. These are unique challenges for distributed file systems. Cloud storage systems often store large unstructured content, and the streaming write access patterns typical of such systems allows for optimizations that cannot be made in traditional file systems. A new technique that adapts principals from NOSQL and object storage paradigms - and uses information dispersal for both underlying storage and metadata - provides a viable solution for streaming write access patterns. This technique allows for distributed writes, no single point of failure, scalability of both system size and concurrent clients, and limits performance bottlenecks.

Learning Objectives

Learn how new access patterns for large content repositories allow for optimizations in file system design.
Understand the importance of providing reliability and scalability for both data and metadata.
Learn about optimistic concurrency on an underlying dispersed storage substrate, and how allows effective metadata management without complex distributed transaction systems.
Learn how this technique allows for distributed writes..
Learn how dispersal allows file system design to be simplified by eliminating the complexity of replication management.

Best Practices in Designing Cloud Storage Based Archival Solution

Jim Rice, Principal Engineer, EMC

Abstract

Cloud storage facilitates the use case of digital archiving for long periods of time by transparently providing scalable storage resources. With ever increasing amount of data to be preserved for legal and compliance reasons, cloud storage when designed correctly, can provide a low cost solution in a geographically distributed environment. This presentation highlights the key considerations while developing an archive product using cloud storage based on REST interface. It goes on to highlight the design choices while developing a file based archiving solution to cloud storage using EMC Atmos as an example. The aspects covered in the slides are – security, performance, using vendor neutral APIs, developing portable application irrespective of the backend cloud supported, taking advantage of geographically spread cloud storage nodes, faster searches and an efficient disaster recovery mechanism.

Learning Objectives

Considerations while developing an archival application with cloud storage
Security and performance aspects while designing an application for cloud storage
Leverage cloud vendor provided capabilities in your application

Tape’s Role in the Cloud

Chris Marsh, Market Development Manager, Spectra Logic

Abstract

There is no doubt cloud storage is having a profound impact on IT and how technologies are deployed and consumed. Tape is the strong, silent partner to the cloud – very much present and in use, but completely transparent to the end-user. Chris will discuss how cloud storage’s consumption model is built around ease of use, flexibility and cost savings, and why tape is one of the most logical and cost effective tiers for storing data in the cloud; particularly as the cost difference between tape and disk increases as data sets grow. He will review the key benefits of tape, reveal why it is quickly becoming the media of choice for cloud providers, and provide real-world examples of tape’s role in the cloud.

Learning Objectives

Learn about the two key benefits of tape for cloud storage providers: cost savings (CapEx, OpEx, power & cooling) and multiple media platforms for reliability/continuity.
Learn why tape has become a popular choice for cloud providers in relation to disaster recovery scenarios.
A review of cloud storage provider best practices and how to utilize the right mix of media to guarantee that protected data is available to be restored when needed, including the need for offline

DATA MANAGEMENT

Leveraging public domain email system to backup and archive files on personal computer
Difference in this product as compared to other popular tools like Gmail Drive or Gspace
How to use NTFS features and free email to facilitate low cost backup/archive solution

/etc

A Case Study: Unique NAS Issues and Solutions at The MathWorks

Ira Cooper, Senior Systems Software Engineer, The MathWorks, Inc

Abstract

The MathWorks is not the first company that comes to mind when one thinks of heavy NAS users. However, our testing environment relies heavily on NAS, and our needs are very different from those of most NAS users. As such, we face a unique set of issues and challenges. This case study will trace the progression of the MathWorks' NAS implementation -- from our start with off-the-shelf vendors to our current homegrown solution. We will detail the decisions we made, why we made them, and what ultimately drove us to develop our own solution. We hope that, by the end of this talk, you will have a better idea of what your clients are thinking and why.

Deep Dive into CIM Client Development with SBLIM

Brian Mason, MTS SW 4, NetApp

Abstract

Learning Objectives

Understand Open-FCoE architecture
Understand how to manage Open-FCoE initiator
Compare/Contrast Open-FCoE performance with HW initiator

This presentation will cover some of these new features which includes:

The Impossible Takes Longer" : Emulating Windows File System Semantics on POSIX

Jeremy Allison, Google

Abstract

Over the years Samba has moved from a thin layer of Windows emulation on top of POSIX to implementing something similar to the Windows "File System Algorithms" layer. If you have to emulate Windows completely on the wire, you need to emulate it completely on top of your platform. As most new storage platforms are Linux-based, learn how Samba manages to create the illusion of Windows on POSIX, and about some of the things that are really impossible to get right.

Learning Objectives

Internals of Windows filesystem semantics
Internals of POSIX filesystem semantics
Advanced Linux storage technology

Implementing Alternate Data Streams in Likewise Storage Services

Wei Fu, Software Design Engineer, Likewise Software

Gerald Carter, Director of Engineering, Likewise Software

Abstract

Modern SMB/SMB2 clients make use of alternate data streams for a variety of application purposes such as desktop UI enhancements, additional document properties, and location information for files downloaded from untrusted networks. Expectations from end-users and client machines make support for data streams a highly desirable, if not required, feature in today’s storage devices. Likewise-CIFS is the SMB/SMB2 file server component of the Likewise Open project's Active Directory integration effort and is part of the larger Likewise Storage Services platform. This session will present both an architectural overview of the Likewise PVFS driver’s data stream implementation and as a case study about the effort required to add stream support into a pre-existing file server.

Learning Objectives

Understand the semantics and use of alternate data streams in SMB/SMB2 environments
Explain the stream and file object model used in the Likewise POSIX File System driver (PVFS)
Distinguish between the mechanism and policy of data stream storage on non-stream aware file systems such as Linux’s ext4

FILE SYSTEM

NFSv4 Protocol Development

Tom Haynes, Ph.D., Senior Engineer, NetApp

Abstract

The NFSv4 protocol undergoes a lifecycle of definition and implementation. We'll examine the lifecycle, what goes into the selection of new features, how these features are refined, and the impact these features will have on end users. We'll also look at how implementation experience will feed back into the protocol definition.

Learning Objectives

Understand the NFSv4 protocol delivery model
Understand how implementation experience impacts the final protocol
Understand the new features being delivered

Ceph Distributed Storage

Sage Weil, Co-founder, New Dream Network

Abstract

As the size and performance requirements of storage systems have increased, file system designers have looked to new architectures to facilitate system scalability. ¿½Ceph is a distributed object store, network block device, and file system designed for reliability, performance, and scalability from terabytes to exabytes.

GREEN

A method to vary the Host interface signaling speeds in a Storage Array driving towards Greener Storage

Dr. M. K. Jibbe, Technical Director, NetApp

Arun Rajendran, Software Engineer, NetApp

Abstract

This paper describes a method which we can effectively alter the signaling speeds of a Host Interface based on set performance criterion or user defined time of day criterion that are user definable. The end goals are considerable power savings by changing the signaling speeds to a lower supported speed as our background study indicates the same .Bringing down the MTBF of the components by operating them at nominal speeds and improving the operable life span of the system move towards Greener Storage, low power operation, minimize Heat dissipation and emission reduction. We are also achieving demand based Host interface bandwidth allocation to balance the throughput requirements of the application.

Learning Objectives

Problems and scope of the analysis and investigation ( Power Consumption differences, Power Bandwidth Ratio, Raw Data rate and Effective data rate)
How does the method of this paper make the optimal use of the available raw bandwidth by switching to lesser raw bandwidth if the data rate doesn’t utilize the initial higher bandwidth rate on offer?
How are the performance thresholds defined based on the specified performance metrics capable of the array?
How does the method of this paper proposal will deliver considerable power savings even when the system is active and performing IOs? (Link Speed Decison Change and Decison Logic)
How does the method of this paper address exception conditions and allows rollback to the last supported speed if the system can’t perform a link speed change?

Vibration Management System for Storage Performance

Gus Malek-Madani, CTO and Founder, Green Platform Corporation

Abstract

Gus Malek-Madani, Founder and CTO, Green Platform Corporation, will share 3 sets of test results that demonstrate how normal levels of data center vibration can degrade IOPS and throughput performance in HDDs by as much as 2/3. These tests also show how this lost storage performance can be restored by mitigating vibration.

Learning Objectives

Understand the Vibration Penalty on Spinning Storage
Understand how performance-killing vibration exacerbates the storage bottleneck in data centers
Understand the benefits of removing vibration
See empirical test results that substantiate the "Vibration Penalty"

Hot Topics

Advanced Format in Legacy Infrastructures – Disruptive or Transparent?
Curtis Stevens, Western Digital

Abstract

Since the launch of Advanced Format (AF) technology on hard disk drives in July 2010, many drives have been shipped and integrated using 512 byte emulation standards (AF 512e). As the industry prepares to introduce AF into long-standing legacy infrastructures, including enterprise systems, additional concerns for in compatibility and data loss have be raised. How real are these concerns? Do AF 512e and its 4K native derivative (AF 4Kn) have negative implications to legacy infrastructures? Take this opportunity to learn about the disruptive or transparent nature of AF from industry expert, Curtis Stevens.

Programmable I/O Controllers as Data Center Sensor Networks: Build and Deliver High-Performance Network and Storage Solutions

Sanjeev Datla, Emulex

Recent enterprise class versions of Linux have support for relatively new features in the storage world. This talk will give an update on the status of some of these new features, which versions of Linux support them and how best to partner with the Linux community in testing and evaluating these features. In addition, the talk will give an overview of the Linux development process for storage and file systems and a summary of current work.

The Future of File Protocols: SMB 2.2 in the Data Center
Dr. Thomas Pfenning, General Manager, Microsoft
Jim Pinkerton, Partner Architect, File Server Technologies, Microsoft

Abstract

This talk will introduce SMB 2.2, the next version of the SMB2 protocol, which has been significantly re-designed to support the levels of performance, reliability, continuous availability, scale, and features that are required for server applications. The prior focus of SMB/SMB2/CIFS was in providing Information Worker desktops with shared access to their data. This objective continues with SMB 2.2 to further reduce chattiness and improve end-user responsiveness. However, due to the customer demand for large scale shared file storage in the data center, SMB 2.2 includes major new features to address requirements for transparent failover, bandwidth scale, and application consistent backups. This talk will go through the requirements that server application workloads pose for shared file storage, and describe opportunities for the SMB2 eco-system to move forward to provide shared storage for the virtualized data center, business critical databases, web serving, and other server application workloads.

Scalable Table Stores: Tools for Understanding Advanced Key-Value Systems for Hadoop
Garth Gibson, Professor, Carnegie Mellon University, and CTO, Panasas Inc

Abstract

Inspired by Google’s BigTable, a variety of scalable, semi- structured, weak-semantic table stores have been developed and optimized for different priorities such as query speed, ingest speed, availability, and interactivity. As these systems mature, performance benchmarking will advance from measuring the rate of simple workloads to understanding and debugging the performance of advanced features such as ingest speed-up techniques and function shipping filters from client to servers. This talk describes YCSB++, a set of extensions to the Yahoo! Cloud Serving Benchmark (YCSB) to improve performance understanding and debugging of advanced table store features. YCSB++ includes multi-tester coordination for increased load and eventual consistency measurement, multi-phase workloads to quantify the consequences of work deferment and the benefits of anticipatory configuration optimization such as B-tree pre-splitting or bulk loading, and abstract APIs for explicit incorporation of advanced features in benchmark tests. To enhance performance debugging, we customized an existing cluster monitoring tool to gather the internal statistics of YCSB++, table stores, system services like HDFS, and operating systems, and to offer easy post-test correlation and reporting of performance behaviors.

Leveraging the Cloud for Your Storage Needs
Bret Piatt, Director of Corporate Development, Rackspace

Abstract

Three of out every four U.S. CIOs surveyed by SNIA say they are already using or plan to use public cloud storage offerings. What are the factors that are leading to such a high adoption rate by today’s leaders?

The fact that some Cloud providers are able to make solutions available that previously might have taken a company’s IT staff many months to plan, finance and install has made this a no-brainer for many business leaders.

Rackspace’s Bret Piatt will discuss some of the new services and tools that Rackspace and other leading Cloud providers now offer to help companies manage their storage needs. Bret will also provide an update on OpenStack, a scalable open source cloud computing operating system supported by over one-hundred leading technology companies. He will also provide an update on cloud standards adoption, including the Cloud Data Management Interface (CDMI).

Evolving Enterprise Storage Models Resulting from the Flash Revolution

Andy Walls, Distinguished Engineer and Technical Lead, IBM

Abstract

We will examine how use cases for SSDs are affecting both the adoption of SSDs and server architecture. We will discuss whether SSDs will cause a significant shift back to direct attach storage at least for Flash. If so, we must explore how that storage will be shared and replicated and protected. How are SSDs evolving SAN based storage? Will the form factors for SSDs converge or is there a place for the plethora of formfactors that are appearing? We will show new use cases that exploit SSDs and potential ones in the future.

Apache Hadoop Today and Tomorrow
Eric Baldeschwieler, CEO, Hortonworks

Abstract

Apache Hadoop is having a profound impact on the data industry because of its ability to store, process and analyze very large data volumes in a very cost-effective manner. Eric Baldeschwieler, CEO of Hortonworks and former VP of Hadoop Software Engineering for Yahoo! will provide some insights into how Apache Hadoop is being used today, how it fits into current enterprise data architectures and what's planned for upcoming releases.

NFS

NFS High Availability in Windows

Roopesh Battepati, Principal Development Lead, Microsoft

Abstract

This session covers advancements in high availability for the NFS file services provided in Windows Server. This discussion is centered around using multiple NFS file servers in a failover cluster. The talk will cover briefly cover Failover Cluster Resource model, NFS Resource DLL and NFS Server HA architecture.

NFSv3 and SMB/SMB2 Interoperability in Likewise Storage Services

Evgeny Popovich, Senior Software Engineer, Likewise Software

Abstract

IT professionals are continually striving to reduce the management costs of storage systems and to provide seamless cross-protocol access. NFS and SMB/SMB2 deployments struggle with three common interoperability problems: how to deal with multiple directories (NIS/LDAP/AD), cross-protocol access control, and differences in file locking semantics. Likewise Identity Services provide administrators with a means to interact with directory services and the addition of an NFSv3 server to the Likewise Storage Services architecture makes it simple to solve the other two. The result is a storage layer that does not require users mapping, provides equal access to the same user accessing files from different protocols, works around certain protocol limitations such as the 16 groups AUTH_SYS limitation, and provides support for cross-protocol locking.

Learning Objectives

Understand NFSv3 architecture as part of Likewise File Server
Learn how the Likewise storage architecture helps to solve NFS & SMB/SMB2 interoperability issues, like access control and locking
Understand issues arising from the user-space NFSv3 driver implementation, and their solutions

IETF NFSv4 Working Group -- What's Next?

Spencer Shepler, Performance Architect, Microsoft

Abstract

With the delivery of NFSv4.1 (RFC 5661) in January of 2010, the NFS version 4 community has been busy building and delivering NFSv4.1 products. Fresh from that experience and emerging application requirements, the IETF NFSv4 Working Group has been busy identifying features and building out a proposal for the NFSv4.2 protocol. The attendee will be provided an insider's view of the proposed feature set, the timeline involved, and up-to-the-minute status of where the working group is headed. And most importantly, the attendee will learn what these new NFSv4.2 features will provide for the end application.

Scale-out NAS with NFS Referrals and pNFS

Brad Stone, VP Product Management, Nexenta Systems

Abstract

This talk describes the referral features added to the NFS standard and how to take advantage of the features for scaling out NAS deployments. This feature will be compared with Microsoft's referral-based system, DFS. The talk will also cover pNFS, providing parallel data access for NFS clusters.

Learning Objectives

What is pNFS and how to contribute to ongoing development
Benefits of pNFS for scaling NAS performance
What is NFS referrals and how it compares to DFS
Benefits of NFS referrals

PERFORMANCE

SMB2 - Advancements in Server Application Performance

Dan Lovinger, Principal Software Architect, Microsoft

Abstract

This session discusses SMB2 file services performance, focused on scenarios where the SMB2 client is running an application server workload like SQL Server. This includes extensive comparative analysis of different configurations and specific optimizations for application server workloads.

Performance Analysis of iSCSI & iSER in MPIO Environment.

Seikh Basiruddin, Member Technical Staff, NetApp

Abstract

iSCSI is an emerging storage network technology that allows block-level access to storage devices, such as disk drives, over a computer network. Since iSCSI runs over the ubiquitous TCP/IP protocol, it has many advantages over its more proprietary alternatives. Due to the recent movement toward 10 gigabit Ethernet, storage vendors are interested to see the benefits this large increase in network bandwidth could bring to iSCSI. In order to make full use of the bandwidth provided by a 10 gigabit Ethernet link, specialized Remote Direct Memory Access hardware is being developed to offload processing and reduce the data-copy-overhead found in a standard TCP/IP network stack.This analysis will cover the performance benefit of using RDMA in iSCSI environment over the normal software iSCSI stack. This presentation will also cover the benefit in a single path environment as well as multipaths environment.

Learning Objectives

While using iSCSI over RDMA, it gives around 20% performance advantage over non-RDMA software iSCSI
Used UNH iSCSI initiator & target as its open source, and modified the same to adapt in different environment.
As today's iSCSI solution can't utilize the full bandwidth provided by 10 gig Ethernet, iSER can be used to get better utilization of the existing bandwidth.
iSER can be integrated in all the open-source as well as closed source multupathing solutions and achieve the require High Availability.

PLUGFESTS

Introduction to the SNIA CIFS/SMB/SMB2 Plugfest

To participate in the plug fest, click here.

Abstract

Every year at the Storage Developers Conference, a group of elite engineers hides out in a darkened room with long rows of tables, lots of equipment, plenty of caffeinated beverages, and a guard at the door. What's with that?

This presentation will focus on the application of self-encrypting storage. After a brief overview of SED architecture, listeners will learn how self-encrypting storage based on TCG specifications can address their security needs.

The second half of the presentation will cover possible development of self-encrypting storage in near future. - SED architecture overview - Applications

Transparent encryption with crypto erase (opal, enterprise)
Simple protection mode (Opal, Enterprise)
Multi user storage (Opal, Enterprise)
Secure OS on TCG Enterprise drive - Future beyond TCG standards
Cache erase
GPIO
TPM
Tampering sensors

Learning Objectives

What is self-encrypting drive and what is under the hood
What is self-encrypting drive and what is under the hood
How SED is differ from software-based solutions in performance and threat models
How to use flexibility of SED to address specific practical needs of organizations: no-pain encryption for device sanitization, ATA-like use, dedicated storage areas, etc.
What to expect from the SED technology development in the nearest future

SOLID STATE STORAGE

Flash memory aware software design
Exploit benefits of flash memory
Design around peculiarities of flash memory
Identify applications for flash memory to exploit sweet spot between cost and performance
New cloud/server applications for flash memory

SSDs in the Cloud

Dave Wright, CEO, SolidFire

Abstract

This session will cover three different methods of using solid state drives to provide persistent, high-performance primary storage within the cloud. It will explain the use of solid state as cache, as a storage tier, and as a full data storage solution, covering the advantages and disadvantages of each method. The speaker will also discuss how advances in SSD technology are enabling strides in storage efficiency, as well as performance.

Learning Objectives

How to best leverage SSDs within a cloud storage infrastructure
How customers are using SSDs in the cloud today and how they are benefiting

How Scale-Up and Scale-Out Flash-Based Databases Can Provide Both Breakaway High Performance and Breakaway High Availability for Enterprise and Cloud Datacenters

Dr. John Busch, Founder and CTO, Schooner Information Technology

Abstract

We present emerging storage and database software technologies providing optimal scale-up through ultra-high flash and multi-core parallelism and optimal scale-out through synchronous replication, exploiting commodity hardware advances to yield 10x performance and 90% reduction in downtime, and providing key new building blocks for greatly improving data center QOS and TCO.

Learning Objectives

Information and Data Management Technologies : understand how highly parallel and concurrent software coupled with advanced hierarchical storage management exploit flash memory and multi-core processors for optimal scale-up and cluster-wide synchronous replication for optimal scale-out
Scalable and Distributed Storage Systems : understand the scalability, consistency, and availability trade-offs in scale-up and scale-out architectures, and the key enabling technologies to concurrently optimize them
Large Data Storage and Management: understand how cluster-wide synchronous replication simplifies large data multi-node cluster management by providing fully consistent data, eliminating data loss, and enabling automatic and transparent fail-over and recovery

STORAGE MANAGEMENT

“Windows Server 8” and SMB 2.2 - Advancements in Management

Jose Barreto, Principal Program Manager, Microsoft

Abstract

This session covers advancements in SMB2 file services management. This includes details on specific implementations of industry standards like Web-Based Enterprise Management (WBEM), Common Information Model (CIM) and Web Services-Management (WS-Man). It also includes discussions on management of Windows and Non-Windows systems providing SMB2 file services.

Microsoft SMI-S Roadmap Update

Jeff Goldner, Principal Architect, Microsoft

Abstract

Microsoft has been working to add SMI-S support to our products. Recent progress is evident in the announcement that System Center Virtual Machine Manager (VMM) 2012 will use industry-standard SMI-S storage providers for active management of storage arrays for configuring virtualized environments. This session will detail the progress around SMI-S support by Microsoft and discuss further work to integrate SMI-S into Microsoft’s management infrastructure.

Implementing a SMI-S Provider from Checkbox to Industrial Strength

Steve Peters, Storage Management Software, PMC-Sierra

Abstract

Data storage continues to grow at a rapid pace and managing the data becomes increasingly challenging. Complying with SNIA's SMI-S must be more than a check box. The presentation will chronicle the development and evolution of a full featured SMI-S provider and a web based GUI that manages a PCIe RAID card.

Learning Objectives

Implementing View Classes
Implementing caches to speed access
Why implement indications

Proxy Providers Versus Embedded Providers (SMI-S)

Srinivasa Reddy Gandlaparthi, Software Architect, NetApp, Bangalore India

Abstract

The implementation of SMI-S providers for managing Storage Arrays or Controllers often involves selection of type of providers (proxy or embedded). This presentation compares the advantages and dis-advantages of Proxy and embedded providers, design considerations for selecting any one of them along with various methods of implementing embedded and proxy providers. Issues in managing large number of objects, association traversal related issues and typical capabilities required in the providers to overcome this issue are also discussed. The Management client design consideration while managing proxy and embedded providers is also discussed.

Learning Objectives

Design considerations for selecting proxy or embedded SMI-S providers by the Hardware Vendors
Resolving issues in managing large number of objects in proxy providers
The Management client design consideration while managing proxy and embedded providers

Functional Testing Challenges
Load/Performance Testing Challenges

VIRTUALIZATION

Advancements in Hyper-V Storage

Todd Harris, Sr. Software Development Engineer, Microsoft

Senthil Rajaram, Senior Program Manager, Microsoft

Abstract

Hyper-V is a virtualization solution included as part of Windows Server 2008 and Windows Server 2008 R2. It provides the ability to expose virtual storage to a virtual machine in a number of different ways, including the use of Virtual Hard Drive (VHD) files. The talk will include discussions on different storage configuration options, workloads, and performance for Hyper-V.

Supporting Virtualization and Large workloads on NAS Storage

Dennis Chapman, Senior Technical Director, NetApp

Abstract

This presentation examines the hosting of enterprise level hypervisor and application workloads on storage provided by NAS servers. It will present a brief overview of the two main file protocols NFS & CIFS/SMB. Then a discussion of the use of NAS storage by a hypervisor and its guests. Next the use of NAS storage by a large database. A discussion on configuring namespace to more efficiently support the hypervisor and application workloads. Discussion on the use of array value-add such as snapshots, dedup or cloning with NAS. Finally a discussion of future trends in this area.

Learning Objectives

Basic understanding of current NAS protocols
Unique value NAS protocols provide for virtualization workloads
Future trends in support of these workloads using NAS

Benefits of ARI support in Virtualization

Sivakumar Subramani, Senior Project Leader, Wipro Technologies

Abstract

As per PCI specification, a single physical adapter can support only up to eight individual functions as only three bits are allotted for identifying a function in BDF (BUS / DEVICE / FUNCTION) value used to refer any PCI device. PCI SIG group has come up with a new method called ARI (Alternative Routing ID) to interpret the Device Number and Function Number fields within Routing IDs, Requester IDs, and Completer IDs, thereby increasing the number of functions that can be supported by a single Device. Alternative Routing-ID Interpretation (ARI) enables next generation I/O implementations to support an increased number of concurrent users of a multi-function device while providing the same level of isolation and controls found in existing implementations. While ARI obviously benefits the virtualized operating environments where each Function can be uniquely assigned to a guest OS, ARI also benefits non-virtualized environments where, e.g. due to increased process improvements, a large number of I/O Functions can be integrated into a single Device. This ARI is used in both Multi function adapters and SRIOV (Single Root I/O virtualization) functionalities to support more number of functions on a single physical function. This paper will analyze the benefits that can be achieved by using ARI in Multifunction and SRIOV (in virtualized environments like KVM, VMware).

Learning Objectives

Testing iSCSI/SCSI Targets with synthetic initiators
Using open-source tools in testing
Automated testing of scalable storage

Main menu

You are here

2011 Agenda Abstracts

Best of Fast

A Study of Practical Deduplication

Bill Bolosky, Member, Microsoft Research

Abstract

Emulating Goliath Storage Systems with David

Leo Prasath Arulraj, Software Development Engineer, Amazon

Abstract

Birds of a Feather

Hiring CIFS Talent

Chris Hertel (ubiqx)

Abstract

Distributed Content Caching

Chris Hertel (ubiqx)

Abstract

CIFS/SMB/SMB2

SAS Standards and Technology Update

Harry Mason, Director Industry Marketing, NetApp

Marty Czekalski, Vice President SCSI Trade Association; Interface & Emerging Architecture Program Manager, Seagate

Abstract

File Systems and Thin Provisioning

Frederick Knight, Standards Technologist, NetApp

Abstract

Storage Data Movement Offload

Frederick Knight, Standards Technologist, NetApp

Abstract

Data Integrity from Application to Storage

William Martin, Engineer Consultant, Emulex

Abstract

SMB 2.2: Bigger. Faster. Scalier - (Parts 1 and 2)

David Kruse, Principal Development Lead, Microsoft

Abstract

SMB 2.2: Bigger. Faster. Scalier - (Parts 1 and 2)

Mathew George, Sr. Software Development Engineer, Microsoft

Abstract

Advancements in Backup to Support Application Storage on a File Server

Molly Brown, Principal Development Lead, Microsoft

Abstract

SMB 2.2 over RDMA

Thomas Talpey, Software Architect, Microsoft Greg Kramer, Ph.D., Software Development Engineer, Microsoft

Abstract

SMB2 - Advancements for WAN

Molly Brown, Principal Development Lead, Microsoft Mathew George, Sr. Software Development Engineer, Microsoft

Abstract

Accelerating SMB2

Mark Rabinovich, R&D Manager, Visuality Systems

Abstract

Samba Status Report

Volker Lendecke, Samba Team / SerNet,

Abstract

A CIFS Geek in Exile: What I did on my Holiday

Christopher Hertel, Storage Architect, and CIFS Geek ubiqx Consulting, Inc.

Abstract

CTDB Status - Clustered Samba Growing Up

Michael Adam, Senior Software Engineer, Samba Team / SerNet

Abstract

Experiences in Clustering CIFS for IBM Scale Out Network Attached Storage

Dr. Jens-Peter Akelbein, IBM Germany, Research and Development

Abstract

Hidden Gems in the NAS Protocols James Cain, Principal Software Architect, Quantel Limited

Abstract

Through the Looking Glass; Debugging CIFS/SMB/SMB2

Robert Randall, Senior Software Architect, Micron Technologies, Inc. Christopher Hertel, Storage Architect, and CIFS Geek, ubiqx Consulting, Inc.

Abstract

Lessons learned implementing a multi-threaded SMB2 server in OneFS

Aravind Velamur Srinivasan, Senior Software Engineer, Isilon Systems, Inc

Abstract

Implementing SMB 2.1 In Likewise Storage Services Gerald Carter, CTO, Likewise Software

Abstract

Thinking Inside the Box: Embedded Active Directory / Storage Appliances Based on Samba

Kai Blin, Embedded Developer, Samba Team

Abstract

Moving an Enterprise Database Platform to run on CIFS/SMB/SMB2 File Access Protocols

Kevin Farlee, Storage Engine Program Manager, SQL Server, Microsoft

Abstract

Cloud

Programming the Cloud

Fleur Dragan, Consultant, EMC

Thomas Talpey, Software Architect, Microsoft
Greg Kramer, Ph.D., Software Development Engineer, Microsoft

Molly Brown, Principal Development Lead, Microsoft
Mathew George, Sr. Software Development Engineer, Microsoft

Hidden Gems in the NAS Protocols
James Cain, Principal Software Architect, Quantel Limited

Robert Randall, Senior Software Architect, Micron Technologies, Inc.
Christopher Hertel, Storage Architect, and CIFS Geek, ubiqx Consulting, Inc.

Implementing SMB 2.1 In Likewise Storage Services
Gerald Carter, CTO, Likewise Software

Nishi Gupta, Tata Consultancy Services
Prateek Sinha, Tata Consultancy Services (bio pending)

Jered Floyd, Chief Technology Officer and Founder,
Permabit Technology Corporation

Scott Kipp, Senior Technologist, FCIA
Mark Jones, Director Technical Marketing, Emulex