Comment on page
Extract content world cloud
Tags: #linkedin #worldcloud #content #analytics #dependency
Last update: 2023-05-29 (Created: 2022-06-30)
Description: This notebook provides a way to extract content from LinkedIn and visualize it in a word cloud. To run this notebook, you must have already run LinkedIn_Get_profile_posts_stats.ipynb or LinkedIn_Get_company_posts_stats.ipynb to get your post stats in CSV.
Disclaimer:
This code is in no way affiliated with, authorized, maintained, sponsored or endorsed by Linkedin or any of its affiliates or subsidiaries. It uses an independent and unofficial API. Use at your own risk.
This project violates Linkedin's User Agreement Section 8.2, and because of this, Linkedin may (and will) temporarily or permanently ban your account. We are not responsible for your account being banned.
try:
from wordcloud import WordCloud
except:
!pip install wordcloud --user
from wordcloud import WordCloud
import matplotlib.pyplot as plt
import pandas as pd
# Input
csv_input = f"LINKEDIN_PROFILE_POSTS.csv" # CSV path with your posts stats generated with 'LinkedIn_Get_profile_posts_stats.ipynb' or 'LinkedIn_Get_company_posts_stats.ipynb'
# Outputs
name_output = "LINKEDIN_CONTENT_WORLD_CLOUD"
image_output = f"{name_output}.png"
naas.dependency.add()
# -> Uncomment the line below to remove your dependency
# naas.dependency.delete()
Get posts feed from CSV stored in your local (Returns empty if CSV does not exist)
def read_csv(file_path):
try:
df = pd.read_csv(file_path)
except FileNotFoundError as e:
# Empty dataframe returned
return pd.DataFrame()
return df
df_posts = read_csv(csv_input)
print("✅ Posts fetched:", len(df_posts))
df_posts.head(1)
# Creating the text variable
text = " ".join(text for text in df_posts.astype(str).TEXT)
# Creating word_cloud with text as argument in .generate() method
word_cloud = WordCloud(
collocations=False, background_color="white", width=1200, height=600
).generate(text)
%matplotlib inline
# Display the generated Word Cloud
plt.imshow(word_cloud, interpolation='bilinear')
plt.axis("off")
plt.show()
# Save your image in PNG
word_cloud.to_file(image_output)
# Share output with naas
naas.asset.add(image_output)
# -> Uncomment the line below to remove your asset
# naas.asset.delete(image_output)
Last modified 3mo ago