Introduction

UML Class diagrams are one of many UML diagrams for describing various aspects of a software, data, business, or enterprise architecture. They are used for a number of purposes including object-oriented software design, business data models, and conceptual data models. In this course, we are using a subset of the UML Class diagram symbol set and modeling elements to visualize domain ontologies and conceptual data models.

Overview

Watch the narrated tutorial below before proceeding the the remaining lesson in order to can get context.

Relationships

Class diagrams show classes (entities of a domain), their attributes, and their relationships. UML distinguishes among several kinds of relationships, including four which are of particular interest for conceptual modeling: association, aggregation, and composition, and generalization. Additionally, there are implementation and dependency relationships, but those are of primary interest to software architects building object-oriented designs – we will omit those two relationships.

Association

An association is a domain-level connection between two classes where neither is dominant over the other. Associations can have an optional text label that explains the nature of the association. Additionally, all associations must be multiplicity constraints on both ends of the relationship.

A multiplicity constraint is in the form of n..m where n and m are the lower and upper bounds, respectively, and n ≤ m, and n ≥ 0 and m ≥ 1. A bound can be a specific scalar value, e.g., 1, 5, 52.

If the upper bound is not known or there is no specific upper bound then it can be left unspecified using *. If the lower and upper bounds are the same, a single bound suffices. So, 1..5, 0..*, and 52 are valid multiplicity constraints.

Note that * by itself is not considered an appropriate multiplicity constraint as there is no explicit lower bound. However, according to the UML Specification 2.5.1, * by itself implies a lower bound of 0, so * is the same as 0..*.

The diagram fragment below shows an example that includes associations only, along with labels, reading directionality indicator (►), and multiplicity. The reading direction is generally left-to-right and top-to-bottom but the directionality indicators allow for free-form placement of modeling elements.

The line itself should not have arrows as associations are bidirectional and equally valid in both directions. The arrow on an association line is useful for database design or software design to indicate which object has a reference (or pointer or foreign key) to the other object, but that is not appropriate for conceptual models as such models are intended to be implementation agnostic.

The UML Class Diagram fragment below illustrates these modeling elements.

The diagram has three classes: Passenger, Ride, and Driver. They are linked – or associated – and modeled with an association relationship. A passenger takes zero or more rides while a ride is for one or at most two passengers. The ride is driven by exactly one driver and a driver can drive any number of rides, including none.

Association Tutorial

In this short tutorial, Khoury Boston’s Prof. Schedlbauer goes through an example on how to define an association, label, directionality, and multiplicity.

He also explains the difference between association and aggregation.

Reading Multiplicities

Multiplicities are constraints or rules defined by the business domain and they can be different for the same pair of classes in different domains. In the above UML fragment, the multiplicity constraint between “Passenger” and “Ride” should be read as follow:

Any passenger may take several rides (the * in 0..*). Some Passengers do not take any rides (the 0 in 0..*). A ride is taken by up two two passengers – or – A ride is taken by no more than two passengers. Every ride must be taken by at least one passenger.

Multiplicities as Object Set Mappings

To establish a multiplicity, you take “one” instance from one class and ask to how many of the other class it is connected to, and vice versa. We are essentially defining a mapping between two sets of objects. A class can be thought of as a set of objects and we are building a mapping between the two.

The short tutorial below provides additional insight into how these mappings are established.

Aggregation

An aggregation relationships is a special form of association where one class in the relationship is the aggregate (collection) of the other class. This is an anti-symmetric whole-part relationship. Since an aggregation is an association, all of the properties of an association also apply to an aggregation.

Composition

A composition is a stronger and more restrictive form of aggregation. All of the properties of an aggregation also apply to composition. And, since aggregation is a special form of association, then by the rules of transitivity, all properties of associations also apply to composition. While a composition has a label, it is often omitted as the implied label is “contains” or “has a” and therefore does not add any additional insights.

Composition is also often used to imply that a “part” cannot exist independently of its “whole” or its container. For example, a room cannot exist independent of a house, so there’s a composition relationship between house (container) and room (part).

Of course, depending on the domain and its rules, the diagram might be different. For example, in the above diagram, it implies that a house can only have one garage, although the garage can have several spots in which to park cars. Naturally, there might be houses that have multiple, independent garages, but we are not modeling that.

Generalization

A generalization defines a class hierarchy indicating that a subclass is a special type of the superclass. All attributes applying to the superclass also apply, by definition, to the subclass. Generalization is a generalization-specialization taxonomy. This relationship is often implemented in object-oriented programming languages with inheritance.

In UML, a generalization is visualized with a hollow area pointing from the specialized class (subclass) toward its generalized class (superclass). We often use the phrase “is a type of” or “is a kind of”.

Generalization relationships are relationships between classes and not mapping between sets of objects, so there is no multiplicity or label. The diagram below illustrates a generalization hierarchy.

It shows that there are two “special” types (kinds) of members: honorary members and officers. Both are members and have all of the attributes and properties of a member but they each have unique properties just applicable to them. We often say that the subclass “inherits” the properties of its superclass, so in the above example, “Officer” inherits the attributes of “Member”, which means that an “Officer” has a name and a date joined and additionally has a role attribute – an attribute that only officers have and not all members in general.

Honorary members and officers can also be classified as members, but honorary members are a subset of all members; hence the term subclass for “Honorary Member” and for “Officer”. And all members is a superset of all special types of members; hence the term superclass for “Members”.

Object-Level Relationships

Associations, and implicitly, aggregations and compositions, are relationships among objects. Generalization is a taxonomy of classes and because of that multiplicity constraints do not apply to Generalization. One can look at classes as set of objects, e.g., class A is the set of all objects that have the structure, attributes, and properties of A. Objects can be viewed as instances of classes having specific values for the attributes and properties but conforming to all of the constraints defined by the class to which the objects belong.

UML Meta Model

The relationships and their “relationships” can be summarized in the UML meta model below.

Goal of Modeling

A common refrain among those new to conceptual and business data modeling is that since the implementation of association, aggregation, and composition doesn’t differ in a relational database or most object-oriented programming languages, then why does one care. The answer is simple: because the relationships exist in the business domain and therefore must be expressed. They express constraints and properties of the objects in the business domain. Just because some databases or programming languages are inadequate does not mean that we should ignore the constraints that are present.

Aggregation and Composition Patterns

There are several common “patterns” when modeling that are best expressed as either aggregation or as composition. For all aggregations and compositions, the aggregate/composite must in some way depend functionally or structurally on its components.

The relationships of aggregation and composition are always anti-symmetric and transitive, i.e., if A contains B, then B cannot contain A, and, if A contains B and B contains C, then A contains C.

Assembly-Parts Aggregation

An assembly-parts aggregation where a whole (assembly) is comprised of parts that retain their identity when they are contained in the whole. The whole does not cease to exist when parts are removed. Multiplicity constraints are used to specify whether the assembly must contain specific numbers of a part or whether the part of option. Many manufactured items are assembly-parts aggregations.

Of course, in the above example, engine would also be an assembly-parts aggregation of parts such as fuel pump, engine block, etc. The cost of the automobile is a derived value, calculated as the sum of the costs of the parts that comprise the car. A better model might be to have “Part” as the superclass (generalization) of “Engine” and “Wheel” and to contain cost and serialNo as attributes of “Part.”

The precise definitions of the classes, their attributes, and the multiplicity constraints depend on the business domain. For example, the above would be valid for new automobiles but not for “junk cars” that are salvaged for parts. In that scenario, an automobile has 0..1 engines and 0..4 wheels as some might be removed for salvage and resale.

Container-Content Aggregation

Defines a collection of parts where the parts bear no structural or functional relationship to each other, but there are rules that determine membership in the collection. It is very similar to the Organization-Member aggregation pattern below. An example of a container-content aggregation is a frame on a web page that displays ads. Whether an ad is displayed is subject to certain rules about size, content, payment, and interest to the web page viewer. The aggregation between web page and ad can also be considered a container-content aggregation. Of course, a web page has many more areas so the diagram below is only a fragment.

Organization-Member Aggregation

In this form of aggregation, members are part of an organization and the organization is defined as the collection of its members. For example, a cricket team (organization) consists of players (members) or a yacht club consists of members. The nature of the organization is its collection of members but members can come and go without the organization ceasing to exist. There may be a constraint on the minimum required number of members. For example, in Germany, a club (“Verein”) must have at least seven members in order for it can be recognized. Additionally, there may be rules on when a member may part of the organization, although such constraints are managed through application logic, database triggers, or in other ways.

Material-Object Composition

In this relationship pattern, the parts (materials) loose their identity when they are put into the composite (the object) and they cannot be removed once they are part of the composite. For example, paper is made from wood fibers, cotton, and glue. Once those parts are added to the object, they can no longer be removed. Phrases to look for during analysis include: is made from; consists of; is partly. Note that the diagram below does not include a quantity of each material – it would need to be included as part of another class if this were used in a manufacturing domain.

Portion-Ingredient Composition

This pattern is similar to Material-Object except that we view a whole (the composite) as a collection of portions of the whole. The parts are of the same class as the whole (a homomeric relationship).

Examples

  • a slice of pie is a portion of a whole pie
  • an inch is part of a foot.

Phrases to look for during analysis: portion of; references to measurements

Place-Area Composition

This pattern describes a link between similar classes where one is included in the other. Often applies to geographical domains.

For example, Palm Beach is located in Palm Beach County which in turn is located in Florida which is part of the United States which is part of North America, an so on.

This can also apply to real estate and building domains, e.g., an office is on a floor which is in a building. It is different from a container-content aggregation in that the parts cannot be removed. For example, one cannot remove Florida from the United States – of course, one could argue that cessation is an option and thus Florida could be removed from the United States or that Florida could merge with Georgia in a new state. Naturally, the exact rules depend on the domain and could change over time.

Collection-Members Composition

A collection-member composition where parts are permanently fused to the whole and there’s an ordering to the parts. For example, a weekly time sheet consists of daily time sheets or a flight reservation consists of flight segment reservations.

Conclusion

On many occasions some relationship can be viewed as falling into more than one pattern – the precise pattern classification does not matter. The purpose of the patterns is to present various situations where an analyst might discover aggregations or compositions rather than simple associations. When is doubt, make the relationship an association – it can always be refined into an aggregation or a composition when additional constraints are discovered.


Files & Resources

All Files for Lesson 30.155

Errata

Let us know.

---
title: "Relationship Types in UML Class Diagrams"
params:
  category: 30
  stacks: "0,0"
  number: 155
  time: 45
  level: beginner
  tags: UML,generalization,association,aggregation
  description: "This lesson explains the difference between association,
                generalization, aggregation, and composition in UML Class Diagrams.
                It presents a set of common relationship patterns and provides
                many examples."
date: "<small>`r Sys.Date()`</small>"
author: "<small>Martin Schedlbauer</small>"
email: "m.schedlbauer@neu.edu"
affilitation: "Northeastern University"
output: 
  bookdown::html_document2:
    toc: true
    toc_float: true
    collapsed: false
    number_sections: false
    code_download: true
    theme: spacelab
    highlight: tango
---

---
title: "<small>`r params$category`.`r params$number`</small><br/><span style='color: #2E4053; font-size: 0.9em'>`r rmarkdown::metadata$title`</span>"
---

```{r code=xfun::read_utf8(paste0(here::here(),'/R/_insert2DB.R')), include = FALSE}
```

## Introduction

UML Class diagrams are one of many UML diagrams for describing various aspects of a software, data, business, or enterprise architecture. They are used for a number of purposes including object-oriented software design, business data models, and conceptual data models. In this course, we are using a subset of the UML Class diagram symbol set and modeling elements to visualize domain ontologies and conceptual data models.

## Overview

Watch the narrated tutorial below before proceeding the the remaining lesson in order to can get context.

<iframe src="https://player.vimeo.com/video/792554265?badge=0&amp;autopause=0&amp;player_id=0&amp;app_id=58479" width="480" height="270" frameborder="0" allow="autoplay; fullscreen; picture-in-picture" title="Relationship Types in UML Class Diagrams" data-external="1">

</iframe>

## Relationships

Class diagrams show classes (entities of a domain), their attributes, and their relationships. UML distinguishes among several kinds of relationships, including four which are of particular interest for conceptual modeling: association, aggregation, and composition, and generalization. Additionally, there are implementation and dependency relationships, but those are of primary interest to software architects building object-oriented designs -- we will omit those two relationships.

### Association

An association is a domain-level connection between two classes where neither is dominant over the other. Associations can have an optional text label that explains the nature of the association. Additionally, all associations must be multiplicity constraints on both ends of the relationship.

A multiplicity constraint is in the form of *n..m* where *n* and *m* are the lower and upper bounds, respectively, and *n ≤ m*, and *n ≥ 0* and *m ≥ 1*. A bound can be a specific scalar value, *e.g.*, *1*, *5*, *52*.

If the upper bound is not known or there is no specific upper bound then it can be left unspecified using \*. If the lower and upper bounds are the same, a single bound suffices. So, *1..5*, *0..\**, and *52* are valid multiplicity constraints.

Note that \* by itself is not considered an appropriate multiplicity constraint as there is no explicit lower bound. However, according to the UML Specification 2.5.1, \* by itself implies a lower bound of *0*, so \* is the same as *0..\**.

The diagram fragment below shows an example that includes associations only, along with labels, reading directionality indicator (►), and multiplicity. The reading direction is generally left-to-right and top-to-bottom but the directionality indicators allow for free-form placement of modeling elements.

The line itself should not have arrows as associations are bidirectional and equally valid in both directions. The arrow on an association line is useful for database design or software design to indicate which object has a reference (or pointer or foreign key) to the other object, but that is not appropriate for conceptual models as such models are intended to be implementation agnostic.

The UML Class Diagram fragment below illustrates these modeling elements.

![](images/l-30-155-uml-1.jpg){width="50%"}

The diagram has three classes: Passenger, Ride, and Driver. They are linked -- or associated -- and modeled with an association relationship. A passenger takes zero or more rides while a ride is for one or at most two passengers. The ride is driven by exactly one driver and a driver can drive any number of rides, including none.

#### Association Tutorial

In this short tutorial, Khoury Boston's Prof. Schedlbauer goes through an example on how to define an association, label, directionality, and multiplicity.

He also explains the difference between association and aggregation.

<iframe style="border: 1px solid #464646;" src="https://player.vimeo.com/video/672873846?h=c1ba4d8b53" width="480" height="280" allowfullscreen="allowfullscreen" allow="autoplay; fullscreen; picture-in-picture" data-external="1">

</iframe>

### Reading Multiplicities

Multiplicities are constraints or rules defined by the business domain and they can be different for the same pair of classes in different domains. In the above UML fragment, the multiplicity constraint between "Passenger" and "Ride" should be read as follow:

Any passenger may take several rides (the \* in *0..\**). Some Passengers do not take any rides (the 0 in *0..\**). A ride is taken by up two two passengers -- or -- A ride is taken by no more than two passengers. Every ride must be taken by at least one passenger.

### Multiplicities as Object Set Mappings

To establish a multiplicity, you take "one" instance from one class and ask to how many of the other class it is connected to, and vice versa. We are essentially defining a mapping between two sets of objects. A class can be thought of as a set of objects and we are building a mapping between the two.

The short tutorial below provides additional insight into how these mappings are established.

<iframe style="border: 1px solid #464646;" src="https://northeastern.hosted.panopto.com/Panopto/Pages/Embed.aspx?id=7335cf1e-230d-488f-9e1e-ad2e000585b1&amp;autoplay=false&amp;offerviewer=false&amp;showtitle=false&amp;showbrand=false&amp;start=0&amp;interactivity=all" width="480" height="270" allowfullscreen="allowfullscreen" allow="autoplay" data-external="1">

</iframe>

### Aggregation

An aggregation relationships is a special form of association where one class in the relationship is the aggregate (collection) of the other class. This is an anti-symmetric whole-part relationship. Since an aggregation is an association, all of the properties of an association also apply to an aggregation.

### Composition

A composition is a stronger and more restrictive form of aggregation. All of the properties of an aggregation also apply to composition. And, since aggregation is a special form of association, then by the rules of transitivity, all properties of associations also apply to composition. While a composition has a label, it is often omitted as the implied label is "contains" or "has a" and therefore does not add any additional insights.

Composition is also often used to imply that a "part" cannot exist independently of its "whole" or its container. For example, a room cannot exist independent of a house, so there's a composition relationship between house (container) and room (part).

![](images/l-30-155-uml-2.jpg){width="60%"}

Of course, depending on the domain and its rules, the diagram might be different. For example, in the above diagram, it implies that a house can only have one garage, although the garage can have several spots in which to park cars. Naturally, there might be houses that have multiple, independent garages, but we are not modeling that.

### Generalization

A generalization defines a class hierarchy indicating that a subclass is a special type of the superclass. All attributes applying to the superclass also apply, by definition, to the subclass. Generalization is a generalization-specialization taxonomy. This relationship is often implemented in object-oriented programming languages with *inheritance*.

In UML, a generalization is visualized with a hollow area pointing from the specialized class (subclass) toward its generalized class (superclass). We often use the phrase "is a type of" or "is a kind of".

Generalization relationships are relationships between classes and not mapping between sets of objects, so there is no multiplicity or label. The diagram below illustrates a generalization hierarchy.

![](images/l-30-155-uml-7.jpg){width="60%"}

It shows that there are two "special" types (kinds) of members: honorary members and officers. Both are members and have all of the attributes and properties of a member but they each have unique properties just applicable to them. We often say that the subclass "inherits" the properties of its superclass, so in the above example, "Officer" inherits the attributes of "Member", which means that an "Officer" has a name and a date joined and *additionally* has a role attribute -- an attribute that only officers have and not all members in general.

Honorary members and officers can also be classified as members, but honorary members are a subset of all members; hence the term *subclass* for "Honorary Member" and for "Officer". And all members is a superset of all special types of members; hence the term *superclass* for "Members".

### Object-Level Relationships

Associations, and implicitly, aggregations and compositions, are relationships among objects. Generalization is a taxonomy of classes and because of that multiplicity constraints do not apply to Generalization. One can look at classes as set of objects, e.g., class A is the set of all objects that have the structure, attributes, and properties of A. Objects can be viewed as instances of classes having specific values for the attributes and properties but conforming to all of the constraints defined by the class to which the objects belong.

## UML Meta Model

The relationships and their "relationships" can be summarized in the UML meta model below.

![](images/l-30-155-uml-meta-model.jpg){width="60%"}

## Goal of Modeling

A common refrain among those new to conceptual and business data modeling is that since the implementation of association, aggregation, and composition doesn't differ in a relational database or most object-oriented programming languages, then why does one care. The answer is simple: because the relationships exist in the business domain and therefore must be expressed. They express constraints and properties of the objects in the business domain. Just because some databases or programming languages are inadequate does not mean that we should ignore the constraints that are present.

## Aggregation and Composition Patterns

There are several common "patterns" when modeling that are best expressed as either aggregation or as composition. For all aggregations and compositions, the aggregate/composite must in some way depend functionally or structurally on its components.

The relationships of aggregation and composition are always anti-symmetric and transitive, *i.e.*, if **A** contains **B**, then **B** cannot contain **A**, and, if **A** contains **B** and **B** contains **C**, then **A** contains **C**.

### Assembly-Parts Aggregation

An assembly-parts aggregation where a whole (assembly) is comprised of parts that retain their identity when they are contained in the whole. The whole does not cease to exist when parts are removed. Multiplicity constraints are used to specify whether the assembly must contain specific numbers of a part or whether the part of option. Many manufactured items are assembly-parts aggregations.

![](images/l-30-155-uml-3.jpg){width="60%"}

Of course, in the above example, engine would also be an assembly-parts aggregation of parts such as fuel pump, engine block, etc. The cost of the automobile is a derived value, calculated as the sum of the costs of the parts that comprise the car. A better model might be to have "Part" as the superclass (generalization) of "Engine" and "Wheel" and to contain *cost* and *serialNo* as attributes of "Part."

The precise definitions of the classes, their attributes, and the multiplicity constraints depend on the business domain. For example, the above would be valid for new automobiles but not for "junk cars" that are salvaged for parts. In that scenario, an automobile has *0..1* engines and *0..4* wheels as some might be removed for salvage and resale.

### Container-Content Aggregation

Defines a collection of parts where the parts bear no structural or functional relationship to each other, but there are rules that determine membership in the collection. It is very similar to the Organization-Member aggregation pattern below. An example of a container-content aggregation is a frame on a web page that displays ads. Whether an ad is displayed is subject to certain rules about size, content, payment, and interest to the web page viewer. The aggregation between web page and ad can also be considered a container-content aggregation. Of course, a web page has many more areas so the diagram below is only a fragment.

![](images/l-30-155-uml-4.jpg){width="60%"}

### Organization-Member Aggregation

In this form of aggregation, members are part of an organization and the organization is defined as the collection of its members. For example, a cricket team (organization) consists of players (members) or a yacht club consists of members. The nature of the organization is its collection of members but members can come and go without the organization ceasing to exist. There may be a constraint on the minimum required number of members. For example, in Germany, a club ("Verein") must have at least seven members in order for it can be recognized. Additionally, there may be rules on when a member may part of the organization, although such constraints are managed through application logic, database triggers, or in other ways.

![](images/l-30-155-uml-5.jpg){width="60%"}

### Material-Object Composition

In this relationship pattern, the parts (materials) loose their identity when they are put into the composite (the object) and they cannot be removed once they are part of the composite. For example, paper is made from wood fibers, cotton, and glue. Once those parts are added to the object, they can no longer be removed. Phrases to look for during analysis include: is made from; consists of; is partly. Note that the diagram below does not include a quantity of each material -- it would need to be included as part of another class if this were used in a manufacturing domain.

![](images/l-30-155-uml-6.jpg){width="60%"}

### Portion-Ingredient Composition

This pattern is similar to *Material-Object* except that we view a whole (the composite) as a collection of portions of the whole. The parts are of the same class as the whole (a [homomeric](https://en.wikipedia.org/wiki/Homomeric) relationship).

**Examples**

-   a slice of pie is a portion of a whole pie
-   an inch is part of a foot.

Phrases to look for during analysis: portion of; references to measurements

### Place-Area Composition

This pattern describes a link between similar classes where one is included in the other. Often applies to geographical domains.

For example, Palm Beach is located in Palm Beach County which in turn is located in Florida which is part of the United States which is part of North America, an so on.

This can also apply to real estate and building domains, *e.g.*, an office is on a floor which is in a building. It is different from a container-content aggregation in that the parts cannot be removed. For example, one cannot remove Florida from the United States -- of course, one could argue that cessation is an option and thus Florida could be removed from the United States or that Florida could merge with Georgia in a new state. Naturally, the exact rules depend on the domain and could change over time.

### Collection-Members Composition

A collection-member composition where parts are permanently fused to the whole and there's an ordering to the parts. For example, a weekly time sheet consists of daily time sheets or a flight reservation consists of flight segment reservations.

## Conclusion {#concl}

On many occasions some relationship can be viewed as falling into more than one pattern -- the precise pattern classification does not matter. The purpose of the patterns is to present various situations where an analyst might discover aggregations or compositions rather than simple associations. When is doubt, make the relationship an association -- it can always be refined into an aggregation or a composition when additional constraints are discovered.

------------------------------------------------------------------------

## Files & Resources

```{r zipFiles, echo=FALSE}
zipName = sprintf("LessonFiles-%s-%s.zip", 
                 params$category,
                 params$number)

textALink = paste0("All Files for Lesson ", 
               params$category,".",params$number)

# downloadFilesLink() is included from _insert2DB.R
knitr::raw_html(downloadFilesLink(".", zipName, textALink))
```

------------------------------------------------------------------------

## References

-   [Visual Paradigm (nd). UML Association vs Aggregation vs Composition](https://www.visual-paradigm.com/guide/uml-unified-modeling-language/uml-aggregation-vs-composition/)

## Diagram Sources

-   [LucidChart Diagram: Tutorial](https://lucid.app/lucidchart/a7004232-32f7-46dc-aa49-907714ebba4a/edit?viewport_loc=157%2C69%2C1616%2C1143%2C0_0&invitationId=inv_ccbc8099-3c4d-4cb1-8d2a-eb8ed31ab961)

-   [LucidChart Diagram: UML Patterns](https://lucid.app/lucidchart/1357edf9-534f-4abf-9550-bf8719efbb57/edit?viewport_loc=112%2C-38%2C1303%2C1001%2C0_0&invitationId=inv_cab5a5b7-6aca-4b17-8d49-d343579685f1)

## Errata

[Let us know](https://form.jotform.com/212187072784157){target="_blank"}.
