Internet DRAFT - draft-barkai-scmp
draft-barkai-scmp
S. Barkai
Internet Draft G. Stupp
draft-barkai-scmp-00 Sheer Networks
Expires: 1 July 2003 30 December 2002
Service Centric Management (SCM)
Status of this Memo
This document is an Internet-Draft and is in full conformance with
all provisions of Section 10 of RFC2026.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that other
groups may also distribute working documents as Internet-Drafts.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference mate-
rial or to cite them other than as "work in progress."
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.
Copyright Notice
Copyright (C) The Internet Society (2002). All Rights Reserved.
Abstract
With the proliferation of IP based internetworking services to mass
business and consumer markets carriers are faced with operational
challenges in an extent never before experienced in previous LAN MAN
or WAN environments. To accommodate these new internetworking envi-
ronments, carrier workflow management applications for service Ful-
fillment, Assurance, and Billing (FAB) need to undergo major a tran-
sition. In essence applications need to transition from being per
Barkai [Page 1]
Internet Draft December 2002
technology centric to being service centric, as historic correspon-
dence between technology and service is no longer true. Applications
do not need to change in nature, since they still need to support
inventory, CRM, order and other workflows. However, applications
need to include a service centric aspect as basis for these work-
flows.
Service Centric solutions can be shared by a wide range of management
and workflow applications, rather than be re-invented by each one.
Service Centric Management solutions must be distributed in order to
cope with the massive scale and complexity challenges of the underly-
ing networks. Service Centric Management Protocol (SCMP) and distri-
bution should be standardized to facilitate multi-vendor interoper-
ability in this emerging space, allow for potential future embedding
within NE, and to allow for Service Centric Management primitives to
extend across Inter Carrier Interfaces (ICI) and inner carrier regu-
latory bounds.
1. Introduction
With the proliferation of IP based internetworking services to mass
business and consumer markets carriers are faced with operational chal-
lenges in an extent never before experienced in previous LAN MAN or WAN
environments.
In order to deliver end-to-end services carriers are extending millions
of access lines - DOCSIS,DSL, or GPRS, aggregating them using ATM and
Ethernet domains, and transporting them over WDM, SONET, and MPLS back-
bones.
A wide range of tunnelling protocols such as PPP, L2TP, and IP-Sec are
being used to map packets in and out of access, edge and core technology
domains.
For example, millions of mobile wireless data services are implemented
using BSC based GPRS, connected to DSL multiplexers, connected to ATM
switches, LAC-LNS BRAS aggregators, PE routers, P core routers and opti-
cal backbones. In addition SNCDP, GTP, and LT2P tunnels are extensively
used across these various domains and thus add to the diversity and com-
plexity of the network.
To accommodate these new internetworking environments, carrier workflow
management applications for service Fulfillment, Assurance, and Billing
(FAB) need to undergo major a transition. In essence they need to tran-
sition from being per technology centric to being service centric.
Typical carrier management and workflow applications include:
Barkai [Page 2]
Internet Draft December 2002
o Inventory Management
o Trouble Ticketing Management
o Order Management
o SLA Management
o etc.
Since most services are becoming cross-domain and multi-technology, car-
rier applications can no longer relay on the historical correspondence
between service and technology. Applications do not need to change in
nature,
since they still need to support inventory, CRM, order and other work-
flows. However, applications need to include a service centric aspect
as basis for these workflows.
Service Centric Management aspects and basis include:
o Comprehensive End-To-End Service Resources Reconciliation
o Consistent End-To-End Service Configuration and Activation
o Correlated End-To-End Service Monitoring and Detection
The transition of carrier management and workflow applications from
being Per Technology to being Service Centric poses great challenges. It
has been proved exceedingly difficult to implement management systems
that include service centric management aspects and withstands the huge
pressures of end-to-end network mass and complexity.
We claim that there is a set of challenges shared by most management
applications. The absence of a general solution causes fragmentation,
loose integration, performance compromises, technology stove piping,
fragile structures and unprofitable carrier operations.
OMS CRM TTM
|\ /|/ /|
| \/ / \/ |
| /\/| /\ |
|/ /\|/ \|
TDM ATM IP
Figure 1: Technology centric integration mesh is expensive
and does not reflect IP over ATM over TDM service.
Barkai [Page 3]
Internet Draft December 2002
2. Service Centric Management Fabric (SCMF)
In this paper we suggest that a comprehensive end-to-end service centric
management basis over large diverse networks, by a range of carrier man-
agement applications, is attainable. We suggest that current compro-
mises are not a necessary evil, and that standard Service Centric solu-
tions can be shared by a wide range of management and workflow applica-
tions, rather than re-invented by each one.
We also suggest that shared and standard Service Centric solutions need
to be distributed. Just as we don't expect a single server to provide
IP connectivity to millions of users (we distribute the task to switches
and routers), the task of comprehensive service centric end-to-end func-
tions can only be performed by a distributed Service Centric Management
Fabric (SCMF). The pressures of end-to-end complexity and scale are
simply to great for any centralized entity.
The SCMF is therefore realized by multiple Service Centric Management
Units (SCMUs), whose number is proportional to the size of the network.
By employing Service Centric Management Protocol (SCMP) the SCMUs form a
scalable-shared SCMFabric.
The SCMF is not intended to be a FAB management application. Rather, it
offers a set of primitives that can be used by a wide range of manage-
ment applications. By offering these primitives the SCMF decouples the
overwhelming burden of network size and complexity from management
applications.
In a reference SCMP implementation existing management systems such as
trouble ticketing, CRM, inventory, and order management systems where
successfully integrated. Moreover, 3rd parties developed new and unan-
ticipated applications as well, taking advantage of the same SCMF primi-
tives.
Any standard, application oriented, north bound API such as TMF 608/513
can be used to access the SCMF primitives. Further discussion on this
topic will be done in a separate paper. South bound manager-agent ori-
ented API are used between the SCMF and the network elements. These can
be standard SNMP and/or CLI commands used extensively today.
OMS CRM TroubleTM InventoryM SLAM ..
| | | | |
______________________________
| SCMF |
|____________________________|
| | | | |
BBand TDM ATM IP VoXX
Barkai [Page 4]
Internet Draft December 2002
Figure 2: Service Centric Management Fabric.
North clients server primitives API. South manager agents FCAPs API.
3. Service Centric Management Primitives
The SCMF offers a set of primitives that support a wide range of service
management and FAB applications, existing and new ones. It is expected
that additional primitives will be evolving as time progresses and addi-
tional application bottlenecks can be factored into the distributed
SCMF.
To date we have identified the following primitives:
Reconciliation - The discovery and up-to-date knowledge of all existing
network resources and services configured in the Network Elements (NEs).
This includes:
o All NE interfaces such as Ethernet, SONET, ATM, IP etc.
o Interface stacking such as IP over ATM over SONET over WDM over
fiber, or IP over HDLC over fiber.
o All forwarding entities such as multiplexing, switching, and rout-
ing, dot1D, BGP, NNI etc defined over the interface stacks. For
example, bridging between 5 stacks, routing between bridges and 2
other stacks.
o All cross NE relationships in the network, e.g., far-end interfaces
and next hop routing or bridging, in every interface and every
layer. We call this generalization of the physical topology the
interoperability topology of the network.
Reconciliation is a continues process as it needs to reflect in near-
real-time HW and SW configuration changes in the network.
Activation - The design, assign, and configuration of differentiated
services in the network. Service profiles may specify any aspect, stan-
dard or proprietary, of any interface and any forwarding entity, in any
of the NEs along the calculated path. The calculation of optimal path
and service profile constraints is performed in near-real-time. Activa-
tion can be invoked in parallel in multiple locations in the SCMFabric.
Integrity must be maintained especially in cases where multiple
Barkai [Page 5]
Internet Draft December 2002
activated paths are crossed asynchronously.
Detection - The sensing of network operation conditions. The root isola-
tion of a malfunction, disconnection or overload failures. Data path
tracing of the fault to its side-effect and propagation to impacted ser-
vices. The suppression of side-effect alarms and bundling of ripple
impacts with the root failure without manual rule programming or intro-
duction of network foreign computation at the SCMF level.
4. Service Centric Management Units (SCMUs)
SCMUs play mostly an administrative roll in the SCM solution. Their
main function is to hold a set of SCMAgents, which communicate via SCMP
within the SCMU or between the SCMUs. The SCMU main task is to support
the SCM hardware workstation platforms, allow for administrative IP or
IP-Sec connectivity and allow for SCMF general administration.
In reference implementations the SCMUnits are extremely important for
maintaining solution's software versions, security, density, and redun-
dancy. These mechanisms will be discussed in a separate context. In the
reference SCMP implementation we bundle about 1000 SCMAgents per SCMUnit
covering about 100K ports per unit.
In reference implementations we also find it best to limit direct IP or
IP-Sec SCMU administrative connectivity to about 10 SCMUnits (about 10K
NEs, or 1M ports). Beyond this number it is best to have "Core" SCMUs
relay SCMP messages to "Edge" SCMUs. For example for a 10M port solution
it is best to mesh 10 core SCMUs each connected to an edge group of
additional 10 SCMUs, and so on.
North client-server interface
|
_|____ _______ ____________________
| | | | | |
| <SCMP> <SCMP> SCMA <SCMP> SCMA |
| SCMU | | SCMU | | SCMU |
|______| |______| |___________________|
|
|
South manager-agent interface
Figure 3: SCMFabric
Barkai [Page 6]
Internet Draft December 2002
5. Service Centric Management Agents (SCMAs)
The SCMAgents are the logical makings of the SCMFabric. Each SCMAgent
may cover a single network element, where legacy domain managers may
function as an SCMA for the entire domain. The SCMA coverage is for the
purpose of end-to-end service management functionality only and does not
need to cover the complete NE device management functionality.
SCMAs are also referred to as autonomous agents since like routing
agents their primary communication is with other similar SCMAgents and
not with a manager.
The SCMAgents have the following states while operating as part of the
SCMFabric:
o Initialization State - When the (right) SCMAgent initializes, it
uses standard CLI and/or SNMP to learn the interfaces, stacking,
and forwarding configuration of the NE. (right SCMAgent determined
by SCMU).
This step is repeated whenever a configuration change is detected.
Each interface is assigned a SCMXID, which must be unique within
the entire SCMFabric. In the reference SCMP implementation we use
an algorithmic method to produce SCMXIDs and we use a graph for
representing interface, stacking, and forwarding NE relationships
in the SCMAgent. This implementation is optional.
_ _
------|R|-|_|
| |
_ _ _ _
|_|-|B|-|_| |_|
| | |
_ _ _
|_| |_| |_|
Figure 4: Example SCMA representation of Bridging 2 stacks,
and routing bridge to 3rd stack
o Acquaintance State - after it initializes, the SCMA is part of the
administrative network of the SCMFabric. However, in order to
Barkai [Page 7]
Internet Draft December 2002
function and help perform most of the SCMF primitive functions, it
also needs to become part of the interoperability network of the
SCMFabric. The interoperability network of the SCMFabric follows
the interoperability topology of the various managed network lay-
ers.
To become part of the interoperability network, the SCMAgent com-
putes a unique signature for every interface that may have a peer
in another network element, and as consequence, in a peer SCMA.
There may be multiple signature types (SCMSigT) for any given type
of interface (IANA). The computation of signatures may involve pro-
prietary intellectual property that needs to be shared by interop-
erating manufacturers of SCMus.
Trivial signatures such as local and remote interface addresses
carried by ILMI, BGP, or CDP will be specified in a different con-
text.
After SCMXID, SCMSigT, and SCMSigValue are published (on the admin-
istrative SCMF Network) and discovered by a remote SCMA, the SCMA-
gent becomes topologically connected to the acknowledging remote
SCMA, and joins the interoperability SCMF Network. This connection
holds until network conditions change or interfaces in the NEs are
disabled, removed or disconnected.
Each SCMA may be connected to single/multiple SCMAs via single/mul-
tiple interfaces and SCMXIDs.
_ _ _ _ _
------|R|-|_| ................... |_|-|R|-|_|
| | | |
_ _ _ _ _ _ _ _ _
|_|-|B|-|_| |_| |_|-|S|-|_| |_| |_|
| | | | | | |
_ _ _ _ _ _ _
|_| |_| |_| ... |_| |_| ... |_| |_|
Figure 5: Example 3 SCMAs Acquaintance(doted) Brouter, Switch, Router
o Flow State - In this state the SCMA reacts to messages received via
the interoperability and administrative links from the agent to the
Barkai [Page 8]
Internet Draft December 2002
rest of the fabric. The SCMA is expected to receive flow messages,
infer their local significance and local service management
required actions, and terminate or pass it to single/multiple peer
SCMAs.
flow>>
<<flow
<------->SCMA <---------------------->SCMA
| Interoperability Network|
| |
Administrative Network |
| |
| |
.........SCMF ........................SCMF
Figure 6: SCMAs in the administrative and interoperability networks.
Activation, detection, and reconciliation queries generate multiple
types of flows across the SCMFabric. For example an activation flow
of find-best-path may be received by multiple SCMA interfaces and
routed to other SCMAs with a local calculation.
Similarly a detection flow may be received indicating a primary or
secondary fault in a peer SCMA, impact further calculated by local
SCMA, and terminated or routed to other SCMAs.
Specific flows, as well as states that avoid loops or perform split
horizon or other routing methodologies, are discussed separately.
Activation flows are typically initiated by SCMAgents covering the
end-points of a service. These flows run through the SCMF core
along the topological links between SMCAgents. They will typically
include a find stage and than reserve and commit stages.
Detection flows are typically initiated from the SMCAs sensing a
root fault and are propagated up the stacks, across forwarding
entities to Peer-Agents and finally to the end-points of the
impacted service. They typically include a verification, expand
and regress stages.
Detailed discussion of flow states and message formats will be done
separately. Experience in reference SCMP implementation shows that
activation flows, due to their parallel and non blocking nature,
have managed to reach the full capacity of the network to absorb
configuration. Likewise, detection flows yielded 1000s to 1 ratio
of suppression between root fault and impacted services and side-
effects. Again, due to their parallel nature, they have been able
to absorb all the events the network can generate.
Barkai [Page 9]
Internet Draft December 2002
_ _ _ _ _
Service <<<<<|_|<|_| |_|>|_|>|_|>>>Service
| ^ ^ |
_ _ _ _ _ _ _ _ _
|_|-|_|-|_| |_| |_|-|_|-|_| |_| |_|
| | ^ | | ^ |
_ _ _ _ _ _ _
|_| |_| |_|<<<<<|_| |_|<<x>|_| |_|
Figure 7: Example SCMA Operation in SCMF x Detection Flow
_ _ _ _ _
Service >>>>>>|_||_| |_|>|_|>|_|>>>Service
^ V ^ |
_ _ _ _ _ _ _ _ _
|_|>|_|-|_| |_| |_|-|_|-|_| |_| |_|
^ | V | | ^ |
_ _ _ _ _ _ _
x>>|_| |_| |_|>>>>>|_| |_|>>>>|_| |_|
Figure 8: Example SCMA Operation in SCMF Activation x Flow
6. Service Centric Management Protocol Messages
SCMP messages are received by the SCMAs from other SCMAs on the SCMF
interoperability network formed by acquaintances. SCMP messages are
also received by SCMAs on the SCMF administrative network. The latter
are typically queries and flows initiated by the SCMF application server
adaptation.
In order not to limit the evolution of existing queries and flows and
creation of new types of queries and flows, as well as not to impound
the autonomous nature of each SCMA behavior, at this stage a relatively
broad SCMP message frame is laid out.
SCMP Message{
SCMXID source
SCMXID dest
SCMXFT flow type
SCMXFT flow subtype
SCMXVB variable bind list }
Barkai [Page 10]
Internet Draft December 2002
When flowing over the SCMF topological network the source and dest field
are required to correspond to the SMCXID of the acquainted interfaces of
the SCMAs. These are recognized by the SCMF transport and delivered to
right SCMAs.
In order not impose entire reference implementation of SCMP prior to
further analysis flow types will be loosely described.
o Flow type: Acquaintance.
Subtypes: Publish, Acknowledge, Hand-shake.
SCMA required to:
- Publish local SMXID Interfaces Signatures.
- Acknowledge matching Signature with local interface SCMXID.
- Hand-shake and acknowledgement to establish acquaintance.
o Flow type: Reconciliation Query
Subtypes: Relational expression.
SCMA required to:
- Perform local part of query
- Adjust and augment variable binding
- Pass to other acquainted SCMAs
o Flow type: Activation
Subtypes: Find, Reserve, Commit
SCMA required to:
- Address service profile and constraints in variable bind list
- Forward augmented path metrics with added local costs
- Forward augmented path metrics with local reservation
- Forward augmented path commits after local NE configuration
o Flow type: Detection
Subtypes: Verify, Expand, Retract
SCMA required to:
- Verify sensed flow not side-effect with acquainted SCMAs
- Expand impact propagation to acquainted SCMAs
- Retract back to root retracting flow form end-points
Further discussion on SCMP message syntax and semantics will be done
after further standard analysis.
7. Service Centric Management Fabric Benefits
By supporting the above primitives in a distributed manner the SCMF sig-
nificantly improves the performance, and simplifies the integration of
Barkai [Page 11]
Internet Draft December 2002
service management applications.
Operations done centrally today at sequential S-Time, are performed by
the SCMF in parallel P-Time. The theoretical asymptotes for network of
size N with proportional activation and fault rates are:
o Reconciliation: From S-Time N Log N , to P-Time Log N.
o Activation: From S-Time N Log N, to P-Time Log N.
o Detection: From S-Time N Log N to P-Time Log N.
In practice, because of high log basis (log N reflects the number of
hops in the network, which is usually 4-8) and structural constraints of
existing centralized systems it follows that the performance improvement
is much higher. In practice we get:
o Reconciliation N Resources: S-Time N Square, Fixed P-Time.
o Activation N Services: S-Time N Square, Fixed P-Time.
o Detection N Impacts: S-Time N Square, Fixed P-Time.
As an example, verification of a VPN with 2000 sites for connectivity
and leak security (each peer test 1 sec.) is completed by the reference
SCMF within minutes while a naive centralized test may take days.
The additional benefit is the partition of mass complexity into a set of
well defined and contained smaller problems with simpler solutions that
emerege into a service wide solution.
8. Service Centric Summary
In today's networks, where services are not single technology based and
their number rises to millions per network, new standard based protocol
can alleviate significant operational pain-points and allow for a shared
distributed solution to till now unsolved problems.
The standardization process is aimed to build detailed supported indus-
try basis for a distributed service centric management methodology. The
Service Centric Management Protocol (SCMP) should be standardized to
facilitate multi-vendor interoperability in this emerging space, to
allow for potential future embedding within NE, and to allow for Service
Barkai [Page 12]
Internet Draft December 2002
Centric Management primitives to extend across Inter Carrier Interfaces
(ICI) and inner carrier regulatory bounds.
The SCMP framework adopts scalable concepts from network control planes
such as, "Hello" routing packet, "RSVP" activation packets, and OAM
cell flows. The SCMP applies these methodologies
to generate uniform fabric for end-to-end service management that acts,
"thinks", and scales like a network. The outcome is therefore capable of
dealing with the huge operational challenges that today's networks cre-
ate.
9. AUTHOR ADDRESS
Sharon Barkai
Sheer Networks
Azrieli Center I Round Tower
132 Petach Tikva rd.
Tel Aviv, 67021 ISRAEL
Tel. +972-3-6074111
Fax. +972-3-6074112
EMail: Sharon_Barkai@sheernetworks.com
Barkai [Page 13]