hachyderm.io is one of the many independent Mastodon servers you can use to participate in the fediverse.
Hachyderm is a safe space, LGBTQIA+ and BLM, primarily comprised of tech industry professionals world wide. Note that many non-user account types have restrictions - please see our About page.

Administered by:

Server stats:

9.4K
active users

#DataQuality

3 posts3 participants0 posts today

Data quality is crucial for companies, and Great Expectations (GX) is a powerful framework for data validation.

Integrating GX with Microsoft Fabric allows for efficient, programmatic data checks.

By defining expectations and performing validations, businesses can ensure data accuracy and reliability.

Let's dive into how to set up GX within Microsoft Fabric for effective data management.

Overcoming Data Analytics challenges. Data analytics teams in Austria, Denmark, Germany and Netherlands often struggle with defining a clear data strategy and integrating advanced analytics tools. These challenges limit the potential for data-driven decision-making.

Pain Points:
Defining a comprehensive data strategy.
Integrating advanced analytics tools.
Ensuring data quality and consistency.

As noted in Austrian data analytics challenges, privacy, security, and data ownership are significant concerns. Additionally, barriers to data adoption include data consolidation and governance.

Pain Points and barriers to the adoption of Data services
plainconcepts.com/pain-points-

Austrian data analytics challenges
iktderzukunft.at/resources/pdf

Continued thread

I’m also reminded of the time I spoke at a conference in London that got feedback on presenters from delegates. One delegate complained that they had been expecting the taller, balder Wicklow Daire and asked for a refund of their conference ticket as an impostor had been speaking in that slot.

Reader: it was a #DataQuality conference. Ironically my keynote was about how spelling and misgendering issues re my name led me into my career. And it was HILARIOUS.

An analysis of 100 Fortune 500 job postings reveals the tools and technologies shaping the data engineering field in 2025. Top skills in demand:
⁕ Programming Languages (196) - SQL (85), Python (76), Scala (14), Java (14)
⁕ ETL and Data Pipeline (136) - ETL (65), Data Integration (46)
⁕ Cloud Platforms (85) - AWS (45), GCP (26), Azure (14)
⁕ Data Modeling and Warehousing (83) - Data Modeling (40), Data Warehousing (22), Data Architecture (21)
⁕ Big Data Tools (67) - Spark (40), Big Data Tools (19), Hadoop (8)
⁕ DevOps, Version Control, and CI/CD (52) - Git (14), CI/CD (13), DevOps (7), Version Control (6), Terraform (6)
...

#DataEngineering #BigData #SQL #Python #ETL #AWS #CloudComputing #Spark #DataModeling #DataWarehouse #DevOps #DataGovernance #DataVisualization #MachineLearning #API #Scala #Java #GCP #Azure #Hadoop #Git #CICD #Terraform #DataQuality #Tableau #PowerBI #Collaboration #Microservices #MLOps #TechSkills

reddit.com/r/dataengineering/c

#DataFest2025 #KODAQS #DataQuality
Data Fest 2025, which takes place from 28 to 30 March at the Ludwig-Maximilians-Universität in Munich, is getting closer. KODAQS is officially taking part for the first time this year. The competition offers students the opportunity to work on extensive data sets in teams of 3-5 people within 48 hours. Kodaqs will contribute with a team of experts to measure and analyse the data quality.
datafest.de/home

Replied in thread

@ChrisMayLA6 A key ingredient in AI is data. I have spent much of my career helping organisations manage #dataquality and it is fair to say, the quality of data related to most organisations is pretty poor at best. Feeding poor data into any LLM or AI tool will not deliver the results anticipated, but as often observed, may come up with a plausibly wrong answer. Treat all outputs of AI with extreme caution!

Dealing with a supplier for a repair.
Them: “And that will be done in the Gorey Depot which is the nearest one to you.”
Me: “I’m nowhere near Gorey”
Them: “But your eircode says your nearest depot is in Gorey”
Me: “Nope. It’s 3 minutes from me in Wexford”
Them: “OK. I’ll change that”
#DataQuality