#data-engineering

[ follow ]
#cloud-computing
fromHackernoon
3 months ago
Data science

LLMs in Data Engineering: Not Just Hype, Here's What's Real | HackerNoon

Large Language Models are transforming data engineering by enhancing performance and operational efficiencies.
fromHackernoon
1 year ago
Information security

A Platform-Agnostic Approach in Cloud Security for Data Engineers | HackerNoon

Data is becoming crucial for businesses, and effective data engineering is essential for leveraging cloud technology while addressing associated security risks.
Data science
fromHackernoon
3 months ago

LLMs in Data Engineering: Not Just Hype, Here's What's Real | HackerNoon

Large Language Models are transforming data engineering by enhancing performance and operational efficiencies.
fromHackernoon
1 year ago
Information security

A Platform-Agnostic Approach in Cloud Security for Data Engineers | HackerNoon

Data is becoming crucial for businesses, and effective data engineering is essential for leveraging cloud technology while addressing associated security risks.
more#cloud-computing
fromMedium
4 days ago
DevOps

Evolvability-It's Mostly About Data Contracts

Data Contracts can mitigate complexity in analytic systems by fostering loose coupling and enhancing adaptability.
fromHackernoon
2 weeks ago
Data science

Tired of Copy-Pasting Hive Output? This PySpark Hack Fixes It | HackerNoon

Automating CSV export from Hive or Impala output is essential for efficient data engineering tasks.
Women in technology
fromBusiness Insider
4 days ago

I became a director at Ford after pivoting careers in the last recession. Here are 3 ways to recession-proof your job.

Continuous learning through online courses is key to job security in recessionary times.
#acquisition
fromChannelPro
5 days ago
Data science

Datatonic expands global services with Syntio acquisition

Datatonic expands its services by acquiring data engineering firm Syntio, enhancing global reach and expertise in AI solutions.
fromTechzine Global
5 days ago
Data science

Datatonic acquires Syntio and strengthens expertise in data engineering

Datatonic's acquisition of Syntio enhances its data consultancy with increased capabilities in data engineering and expanded service offerings.
fromChannelPro
5 days ago
Data science

Datatonic expands global services with Syntio acquisition

Datatonic expands its services by acquiring data engineering firm Syntio, enhancing global reach and expertise in AI solutions.
fromTechzine Global
5 days ago
Data science

Datatonic acquires Syntio and strengthens expertise in data engineering

Datatonic's acquisition of Syntio enhances its data consultancy with increased capabilities in data engineering and expanded service offerings.
more#acquisition
#spark
fromawstip.com
2 weeks ago
Data science

Spark Scala Exercise 23: Working with Delta Lake in Spark ScalaACID, Time Travel, and Upserts

Delta Lake enhances data reliability and governance for data lakes by integrating warehouse features.
frommedium.com
3 weeks ago
Data science

Spark Scala Exercise 10: Handling Nulls and Data CleaningFrom Raw Data to Analytics-Ready

Effective data cleaning is essential in data engineering to prevent downstream issues caused by nulls.
fromMedium
3 weeks ago
Scala

Spark Scala Exercise 1: Hello Spark World with Scala

Understanding Spark initialization is crucial for data engineering tasks.
This exercise introduces key Spark concepts such as SparkSession and lazy evaluation.
Successfully checking the setup ensures readiness for distributed data processing.
fromawstip.com
2 weeks ago
Data science

Spark Scala Exercise 23: Working with Delta Lake in Spark ScalaACID, Time Travel, and Upserts

Delta Lake enhances data reliability and governance for data lakes by integrating warehouse features.
frommedium.com
3 weeks ago
Data science

Spark Scala Exercise 10: Handling Nulls and Data CleaningFrom Raw Data to Analytics-Ready

Effective data cleaning is essential in data engineering to prevent downstream issues caused by nulls.
fromMedium
3 weeks ago
Scala

Spark Scala Exercise 1: Hello Spark World with Scala

Understanding Spark initialization is crucial for data engineering tasks.
This exercise introduces key Spark concepts such as SparkSession and lazy evaluation.
Successfully checking the setup ensures readiness for distributed data processing.
more#spark
#scala
Scala
fromMedium
2 months ago

Scala Vs. Python-What Data Engineers Need To Know

Scala improves upon Java while remaining JVM-compatible, making it attractive for organizations.
fromMedium
5 months ago
Scala

Scala Applications in Data Engineering: A Comprehensive Overview

Scala is an ideal choice for data engineering, particularly with big data frameworks like Apache Spark.
fromMedium
3 weeks ago
Scala

Spark Scala Exercise 2: Load a CSV and Count Rows

Learning to load structured CSV data into Spark DataFrames using Scala prepares aspiring data engineers for essential ETL processes.
Scala
fromMedium
2 months ago

Scala Vs. Python-What Data Engineers Need To Know

Scala improves upon Java while remaining JVM-compatible, making it attractive for organizations.
fromMedium
5 months ago
Scala

Scala Applications in Data Engineering: A Comprehensive Overview

Scala is an ideal choice for data engineering, particularly with big data frameworks like Apache Spark.
fromMedium
3 weeks ago
Scala

Spark Scala Exercise 2: Load a CSV and Count Rows

Learning to load structured CSV data into Spark DataFrames using Scala prepares aspiring data engineers for essential ETL processes.
more#scala
Artificial intelligence
fromMedium
1 month ago

These AI & Data Engineering Sessions Are a Must-Attend at ODSC East 2025

Organizations are focusing on efficiently and securely integrating advanced AI models at scale.
Practical strategies and real-world insights are essential for navigating AI and data engineering challenges.
#data-architecture
Data science
fromMedium
2 months ago

Can Your Data Architecture Handle Tomorrow? Building for Flexibility and Lasting Impact

Good data architecture is essential for effective data engineering and organizational competitiveness.
fromMedium
2 months ago
Data science

Can Your Data Architecture Handle Tomorrow? Building for Flexibility and Lasting Impact

Good data architecture is vital for effective data engineering and organizational competitiveness.
fromHackernoon
6 years ago
Data science

Web3 Data Engineering Crash Course | HackerNoon

Web3 data architecture is transforming how enterprise and scientific data are approached, emphasizing cross-organizational data exchange over internal data.
Data science
fromMedium
2 months ago

Can Your Data Architecture Handle Tomorrow? Building for Flexibility and Lasting Impact

Good data architecture is essential for effective data engineering and organizational competitiveness.
fromMedium
2 months ago
Data science

Can Your Data Architecture Handle Tomorrow? Building for Flexibility and Lasting Impact

Good data architecture is vital for effective data engineering and organizational competitiveness.
fromHackernoon
6 years ago
Data science

Web3 Data Engineering Crash Course | HackerNoon

Web3 data architecture is transforming how enterprise and scientific data are approached, emphasizing cross-organizational data exchange over internal data.
more#data-architecture
#data-management
fromHackernoon
3 years ago
Data science

The Two Types of Data Engineers You Meet at Work | HackerNoon

Data engineers are categorized into two archetypes: business-oriented and tech-oriented, each with distinct roles and responsibilities.
fromMedium
2 months ago
Data science

Understanding Data Generation in Source Systems: How It Works and Real-Time Applications

Understanding data generation is essential for effective data engineering and creating scalable data pipelines.
Data science
fromHackernoon
3 years ago

The Two Types of Data Engineers You Meet at Work | HackerNoon

Data engineers are categorized into two archetypes: business-oriented and tech-oriented, each with distinct roles and responsibilities.
fromMedium
2 months ago
Data science

Understanding Data Generation in Source Systems: How It Works and Real-Time Applications

Understanding data generation is essential for effective data engineering and creating scalable data pipelines.
more#data-management
#ai
fromMedium
6 months ago
Artificial intelligence

Networking, Hackathons, Meetups, and Other Extra Events Coming to ODSC West 2024

The conference provides hands-on AI learning and immersive networking opportunities.
Participants can engage in various thematic events including hackathons and summits.
ODSC West fosters connections among AI professionals and enthusiasts.
fromComputerWeekly.com
2 months ago
Data science

A path to better data engineering | Computer Weekly

Organizations face challenges processing diverse data formats and overcoming data silos.
Traditional data engineering methods struggle with the variability of real-world data.
Understanding the required skills for data sciences is critical for modern data challenges.
fromMedium
6 months ago
Artificial intelligence

Networking, Hackathons, Meetups, and Other Extra Events Coming to ODSC West 2024

The conference provides hands-on AI learning and immersive networking opportunities.
Participants can engage in various thematic events including hackathons and summits.
ODSC West fosters connections among AI professionals and enthusiasts.
fromComputerWeekly.com
2 months ago
Data science

A path to better data engineering | Computer Weekly

Organizations face challenges processing diverse data formats and overcoming data silos.
Traditional data engineering methods struggle with the variability of real-world data.
Understanding the required skills for data sciences is critical for modern data challenges.
more#ai
#analytics
fromfaun.pub
2 months ago
Business intelligence

Serving Data in the Data Engineering Lifecycle: A Comprehensive Guide

Data serving is the culmination of data engineering, delivering value to users through analytics and applications.
fromMedium
6 months ago
Data science

Why I Chose Google Cloud Platform (GCP) for Data Engineering: Real-World Benefits

GCP is preferred for data engineering due to its scalability, integrated analytics, and cost-effectiveness.
fromTechzine Global
2 months ago
Data science

Oh, you wanted data? Confessions of a Test Management Engineer

A focus on data engineering from the outset can enhance test case management tools and improve software development outcomes.
fromfaun.pub
2 months ago
Business intelligence

Serving Data in the Data Engineering Lifecycle: A Comprehensive Guide

Data serving is the culmination of data engineering, delivering value to users through analytics and applications.
fromMedium
6 months ago
Data science

Why I Chose Google Cloud Platform (GCP) for Data Engineering: Real-World Benefits

GCP is preferred for data engineering due to its scalability, integrated analytics, and cost-effectiveness.
fromTechzine Global
2 months ago
Data science

Oh, you wanted data? Confessions of a Test Management Engineer

A focus on data engineering from the outset can enhance test case management tools and improve software development outcomes.
more#analytics
Data science
fromInfoQ
3 months ago

Shaping an Impactful Data Product Strategy

Data teams need a collaborative strategy to align and deliver long-term value rather than reacting to immediate demands.
fromForbes
4 months ago
Software development

Council Post: 15 Reasons To Choose Node.js For Product Development In 2025

Node.js is a crucial framework for modern web development due to its speed, scalability, and efficiency in handling complex applications.
fromMedium
4 months ago
JavaScript

Deep dive on Spark Aggregation APIs

Complex aggregation problems require advanced solutions beyond straightforward SQL functions.
User Defined Aggregate Functions (UDAFs) are essential for calculating median values in Spark.
Performance and implementation ease are critical factors in selecting aggregation techniques.
fromComputerWeekly.com
4 months ago
Artificial intelligence

Computer Weekly Buyer's Guide features list 2025 | Computer Weekly

Computer Weekly's Buyer's Guides educate and guide readers through the IT buying cycle to ensure informed purchasing decisions.
fromInfoQ
5 months ago
Agile

Optimizing Uber's Search Infrastructure: Upgrading to Apache Lucene 9.5

Uber upgraded its search infrastructure from Apache Lucene 8.0 to 9.5, improving search capabilities and overall performance.
Data science
fromMedium
5 months ago

Choosing Your First Language in Data Engineering: A Beginner's Guide

Choosing the right programming language is crucial for your data engineering career.
Python is favored for its simplicity, rich libraries, and big data integration.
#generative-ai
fromthenewstack.io
6 months ago
Data science

Data Observability: Multicloud, GenAI Make Challenges Harder

Acceldata's focus on data observability capitalizes on the exponential growth of data and the increasing complexity of managing it across multicloud systems.
fromInfoQ
10 months ago
Data science

Edo Liberty on Vector Databases for Successful Adoption of Generative AI and LLM based Applications

Vector databases play a critical role in the generative AI or GenAI space.
Data science
fromthenewstack.io
6 months ago

Data Observability: Multicloud, GenAI Make Challenges Harder

Acceldata's focus on data observability capitalizes on the exponential growth of data and the increasing complexity of managing it across multicloud systems.
fromInfoQ
10 months ago
Data science

Edo Liberty on Vector Databases for Successful Adoption of Generative AI and LLM based Applications

Vector databases play a critical role in the generative AI or GenAI space.
more#generative-ai
Data science
fromTechzine Global
6 months ago

With Databricks Apps, business users get more out of data

Databricks Apps enhance data accessibility for business users, enabling quicker insights without extensive engineering work.
fromBerlin Startup Jobs
7 months ago
DevOps

Job Vacancy: Lead Frontend Engineer // GlassFlow | IT / Software Development Jobs | Berlin Startup Jobs

GlassFlow is developing a user-friendly data streaming platform that simplifies real-time data access for engineers.
The role offers a unique chance to shape the future of an innovative data infrastructure startup.
The company focuses on creating an inclusive workplace with competitive benefits for its employees.
fromMedium
7 months ago
Data science

The Importance of Data Structures and Algorithms in the Life of a Data Engineer

Mastering Data Structures and Algorithms is crucial for optimizing data engineering tasks.
fromHackernoon
9 months ago
DevOps

Breaking Down the Worker Task Execution in Apache DolphinScheduler | HackerNoon

Apache DolphinScheduler is an enterprise-level visual workflow scheduling system that offers flexibility, scalability, and robust fault tolerance.
fromBerlin Startup Jobs
9 months ago
DevOps

Job Vacancy: Senior Data Engineer // Latana | IT / Software Development Jobs | Berlin Startup Jobs

Latana provides brand insights for better marketing decisions and works with top B2C brands like Headspace and Unilever to optimize brand performance.
#apache-airflow
fromNew Relic
9 months ago
DevOps

Using OpenTelemetry to monitor Apache Airflow

Monitoring Airflow is vital for optimizing performance and reliability of data pipelines.
fromwww.montecarlodata.com
3 years ago
Business intelligence

The Future of the Data Engineer

Maxime Beauchemin paved the way for data engineering with projects like Apache Airflow and Apache Superset, highlighting the importance of specialized engineers in scaling data science.
fromNew Relic
9 months ago
DevOps

Using OpenTelemetry to monitor Apache Airflow

Monitoring Airflow is vital for optimizing performance and reliability of data pipelines.
fromwww.montecarlodata.com
3 years ago
Business intelligence

The Future of the Data Engineer

Maxime Beauchemin paved the way for data engineering with projects like Apache Airflow and Apache Superset, highlighting the importance of specialized engineers in scaling data science.
more#apache-airflow
fromTechCrunch
10 months ago
Data science

Databricks launches LakeFlow to help its customers build their data pipelines | TechCrunch

Databricks introduced LakeFlow as its internal data engineering solution to handle data ingestion, transformation, and orchestration, reducing the reliance on third-party tools.
[ Load more ]