Python Sets¶
A set is a collection of unique elements. Sets are mutable, meaning they can be modified after they are created, but they can only contain immutable (unchangeable) elements like numbers, strings, and tuples.
Creating a Set¶
You can create a set by using the set()
function or by placing all the elements within curly braces {}
. Here’s an example:
fruits = {"apple", "banana", "cherry"}
print(fruits)
{'banana', 'cherry', 'apple'}
Adding Elements to a Set¶
To add a single element to a set, use the add()
method. To add multiple elements, you can use the update()
method:
fruits.add("orange")
print(fruits)
fruits.update(["mango", "grapes"])
print(fruits)
{'banana', 'orange', 'cherry', 'apple'} {'orange', 'grapes', 'banana', 'apple', 'mango', 'cherry'}
Removing Elements¶
There are several methods to remove elements from a set: remove()
, discard()
, and pop()
:
fruits.remove("banana")
print(fruits)
fruits.discard("apple")
print(fruits)
item = fruits.pop() # Removes a random item
print(item, fruits)
{'orange', 'grapes', 'apple', 'mango', 'cherry'} {'orange', 'grapes', 'mango', 'cherry'} orange {'grapes', 'mango', 'cherry'}
Checking for Membership¶
To check if an item is in a set, use the in
keyword:
if "mango" in fruits:
print("Mango is in the set")
Mango is in the set
Set Operations¶
Sets support mathematical operations like union, intersection, difference, and symmetric difference:
a = {1, 2, 3, 4}
b = {3, 4, 5, 6}
# Union
print(a | b) # or a.union(b)
# Intersection
print(a & b) # or a.intersection(b)
# Difference
print(a - b) # or a.difference(b)
# Symmetric Difference
print(a ^ b) # or a.symmetric_difference(b)
{1, 2, 3, 4, 5, 6} {3, 4} {1, 2} {1, 2, 5, 6}
Set Comprehensions¶
Set comprehensions provide a concise way to create sets. The syntax is similar to that of list comprehensions:
squared = {x**2 for x in range(10)}
print(squared)
{0, 1, 64, 4, 36, 9, 16, 49, 81, 25}
Sets are invaluable when dealing with large datasets requiring the elimination of duplicates, for set operations like those in mathematics, and when the order of items is unimportant. Their performance for membership tests is better than lists or tuples.