My Six Years of Working in Data

Denis: Hi, my name is Denis, and welcome
to my new video, which will be a bit less

technical than my usual Databricks videos.

In this video, I really want to talk
a bit more about my career and more

specifically how I got into data
from a very unlikely, um, beginning

point, how I evolved my career over
the last six years, including having.

Both employed positions,
but also spending thirds.

So four years out of six working
as a freelancer and a contractor.

In fact, I am at the time, at
the time of making this video.

I've been self-employed for about two
years, and currently I see no reason

to go back to an employed position.

So let me tell you my
story of how I got there.

I graduated in 2000.

18, which was a while ago with a degree
in material science and materials

engineering with a minor in chemical
engineering from KU Leuven in Belgium.

I remember in my last year I did a, an
exchange here, which means I spent the

entire 12 months abroad in France at the
University of Chernobyl for five months.

Which landed me an internship position of
six months at the factory in the French

Alps that produced the liquid aluminum.

And this internship was hugely important
in what would eventually become my career

in data, because that's the first point.

I was really, really interacting
with data, and it all came

to pass because of a program.

Language you may be
familiar with called R.

We briefly had a couple
of courses at university.

Um, it was actually in economics where we
used R to analyze some economical data.

I didn't really think much of
it because I was never really

interested in programming.

I was always very interested in.

Physics, chemistry, and other sciences.

For me, programming in
Python was actually annoying.

I never really enjoyed that particular
course at uni, so I wasn't particularly

good at any, uh, at programming.

But at that internship, when I
started working, the main objective

I really did was analyze lots of data
coming from a SQL Server database.

And this really got me in touch
with the programming language,

well, I should say query language,
uh, SQL or SQL as we say in Europe.

And I spent, in essence the next six
months writing SQL queries, bumping that

data into r and r studio and making a
lots of, a lot of visual, nice graphs.

And my colleagues who hired
me at the internship saw that.

I was able to produce beautiful graphs in
a fraction of the time that they could.

Um, eventually I started to get
more and more work in data analysis.

I discovered that there was an entire
field that was coming up at that time.

It was data science, which I think was
in full swing around 2000, um, 18 and 19.

So I kind of wrote this height wave.

Until I completed some courses on Coursera
on machine learning, and that's basically

my first step into the data world.

So after that, I was
hired by the same company.

I did an internship at, the
company was called TRIMET.

They had their headquarters in
Essen, Germany, and they offered me

a job there as a process engineer.

I happily took this job because in
essence, um, the city of Essen was

only two hours away from my home
in Belgium, which mean that I could

prolong my, let's say European
travels by another year or two, right?

I would not have to go back
to Belgium to where I knew

everything and everyone already.

So that opportunity really excited me.

Um, my job itself as a process engineer
was more focused on process control.

I was developing a new control
system using, uh, MATLAB and

Simulink, which eventually got me
a bit away from the field of data.

I still did a lot of data analysis.

I discovered that Biden was actually
beating our, in terms of, um, importance.

Especially in the field of machine
learning, which is why I picked

up Python mostly on the sites.

Um, so on, on the job, I
was doing a lot of art.

I remember in the evenings I spent
a lot of time doing courses, Python,

um, seeing how all of that works.

Nevertheless, after about one year
and a half at this company in Sen.

I got a bit frustrated that most of
my day job was actually not revolving

around data or machine learning.

I really wanted to get deeper into
that field, which is why in 2020, right

before the start of Covid in March,
I remember that month quite well.

I found a new job.

It was at the data science consultancy
company called Stat Works, and they hired

me because they saw I had this intense
passion for data and machine learning.

They gave me a small case, which I managed
to solve quite well, and they were really

impressed by my, um, motivation that
showed itself in, for instance, competing

in Kaggle competitions and so on.

Kaggle was a website for.

Machine learning games
and competitions online.

I'm not sure if it's still very
popular, but at those days it was

like a way to prove your finesse and
your dexterity in machine learning.

So there I was hired as a, what they
call junior data science consultant.

And in essence, I would be
thought on various machine

learning projects and clients.

Unfortunately, because it was
covid, we all soon went into what

the Germans call could abide.

In essence, it means that
you're only employed at 50%,

so you only work half a week.

Nevertheless, that works was so nice to
pay us the full salary, which essentially

meant that I had half of the week
completed to myself, and I remember

really investing that time, getting
really deep into machine learning.

More specifically deep learning, which
was becoming very popular at that time,

especially for applications in computer
vision and natural language processing.

So as part of that journey, I in fact,
quickly discovered that do lots of

machine learning fast, but also cheap.

You need a workstation, and
that's how I spent the next

two months assembling hardware.

I fell into the rabbit hole of computer
building, which means choosing a good CPU,

choosing the correct graphical cart, which
was really the main component at that.

Uh, at that time, I purchased the 2080
TI by Nvidia, which was a very powerful

cart, also very expensive at that time,
and which allowed me to train some pretty

big intricate machine learning models
that I used for my gaggle competitions.

I remember particularly competing in the
Gaggle steel defect detection competition

and got there in the top 20%, which by
all means is not very impressive, but

it's something I was very proud of.

In essence, competing on Gaggle is
what eventually led me to switch from

R completely to Python and learning
the language completely by doing.

I never followed any courses on
Python, didn't really study any books.

But just by writing all of
this code and understanding

the machine learning frameworks
allowed me to master the language.

At the time.

I was also reached out by a former
colleague who worked with me at Premeds.

My first job was an engineer
in SEN, and he was still very

impressed by my ability in art.

And he knew that I was able to produce
nice graphs and applications quickly.

And given that he was a freelancer,
he worked at a, he had a

particular project for that same
company in a different division.

And he desperately needed a tool that
would allow him to visualize a lot

of signal coming from an industrial
controller, um, in one dashboard.

And.

He didn't really want to spend a
lot of time buying an expensive

tool or doing things himself.

So he reached out to me with an idea
that, Hey, de um, would you like to try

to build an app for me as a freelancer?

And I've always wanted to be
more independent of my employer,

be more entrepreneurial.

This first project really gave me the
opportunity to dip my toes into freelance.

I had additional time, I had already a
steady sort of income from my job, and

now I was able to earn, I think, 1.5

K Euros to build a nice
application in R, which was still

one of my favorite language.

So I took this opportunity, um, with
both hands built the application,

spent way too much time on it.

But it felt so satisfying delivering a
project from end start to end without

having someone looking over your shoulder.

No status meetings, no
agile meetings, dailies.

You just promised the work,
agree on the scope, and then

deliver it at the fixed price.

And this way of working appealed
to me so much that already just six

months into my new data consulting
job in Frankfurt, I quit the job

and decided to freelance full-time.

At that time, that was quite a bold
decision because as you remember,

COVID was still in full swing.

Companies were, uh, losing money.

Everyone was afraid.

Nevertheless, I built up, um, a nice nest
egg 'cause I always saved a lot of money.

Um, instead of spending it, and now
essentially I was completely unemployed, I

opened a freelancing business in Germany.

And was now looking for new clients to do.

Unfortunately, my colleague did not
need any additional work, so I was

in fact spending, I think it took
me at least two or three months

before I found my next contract.

My next contract was actually not
in R and not in machine learning.

It was in a Python framework
called Apache Airflow.

Now, Apache Airflow was a framework
that was putting a bit on the side.

During my first, during my second job
at Stats and also during the first month

of my freelancing when I still had my
clients, I was really fascinated by

the aspect of having a clear organized
way of building data pipelines,

essentially batch processing, scheduled
by a workflow orchestrator like Apache

Workflow, and by mid chance on some.

I think it's called a contractor
or contracting company.

I saw a project of a startup in
Cologne that needed someone who could

help them implement Apache airflow.

So I applied for that job, had I
think one or two interviews with

their team, and they liked me, and
so they hired me as freelancer, or

I should say more of a contractor.

And this was quite exciting
because this is what my first.

Real contracting gig with a company that
I couldn't get through via my own network.

This was really a cold, uh, contact.

I worked with them for
about six months, I think.

Delivered a project in a
startup, Apache Airflow.

I also worked quite a bit with
Apache Kafka at that time because

they were an IOT starter, but
really wanted to experiment with,

um, real time communication,
uh, via messaging frameworks.

So I did that for a while.

By now, we are reaching the end of 2020
and at the beginning of 2021, I actually

returned to Belgium and opened an LLC.

So I limited liability company because
my freelancing business was growing.

I realized that I want to do this further.

I want to contract more, and I
needed the company to do this to,

um, be able to secure bigger clients.

So I moved back to Belgium, continued
working for that client, um, for

that particular startup, but was
also looking for new additional work.

And the next project I actually
landed was for a very big company.

It was, uh, Johnson & Johnson in
Belgium, so a big pharma industry.

This was also, um, quite a big step for me
because I left the world of medium-sized

companies and small-sized companies.

I started working for enterprise clients
and I very soon became familiar with

the particular challenges of enterprise.

Being that you have, uh, very large
meetings, often not that much time coding,

you have a lot of different people, and
the scope of the work is so big that

you can essentially don't grasp it all.

So it's as if you're all working on.

Different parts of a big elephant.

Someone's working on the trunk,
someone is feeling the leg.

The only one is working on the
tail, but no one has the view on

the entire elephant very often.

This was a real contracting
position, which means that I got

there via a contracting company.

It was a data engineering company.

That found me.

I was looking for an additional
body or hands, hands on a

keyboard to put on this project.

So they were acting like an intermediary,
receiving a part of my daily rate.

Now, although the pay was good, the
framework that we were, that we were

working on was also very interesting.

It was called Kro.

Um, essentially a pi, uh, PySpark
framework for, uh, machine

learning on very big data.

I did that for a while, but very quickly
got frustrated by a, the many meetings

and the fact that I've lost the initial
freedom I gained by freelancing.

I suddenly had working hours, meetings,
scrum updates, so all the benefits of

being freelancer were no longer there.

I was essentially a full-time contractor.

The second thing I also didn't
really like was that I left.

The manufacturing world.

I was no longer in industry.

And coming from the aluminum sector, being
a materials engineer or metals engineer,

to be more specific, I really missed
the chemistry, the physics, the dust,

the heat from my previous factory jobs.

I wanted to work more
on industrial use cases.

So I did that contract for I
think about a year and a half.

Until I got a very interesting email.

We are now at the beginning of 2022.

In the middle, I was contracted
by, or I was contacted by a German

headhunting bureau that were
looking for a data engineer who

had some experience in aluminum.

At that time, a company called Novelis.

It was the.

I guess still is the world
leader in rolling and recycling.

They were looking for a data
engineer to work on their machine

learning projects in Germany.

And given that I then spoke German, I
had experience in manufacturing, even

in aluminum, and I had data engineering
experience from my last contracting

gig in Apache Airflow, which meant
I knew Spark, I knew Databricks.

I was hired.

As a data engineer, unfortunately
this meant that I had to

close my freelancing business.

At that point.

I was also ready to throw in the towel.

I think I got a bit impatient
and frustrated with the fact that

I wasn't really a freelancer.

I wasn't really free, and I had
this imposter syndrome where I felt

that you know what then is maybe you
need more experience in data before

you can actually be a freelancer.

So I closed my business in Belgium,
relocated to Germany, got a very

good, um, high, um, high paying job
as an employee for this company.

It was fully remote.

I was living in Berlin at that time,
while I did actually spend six months

at the factory in Copeland, which was
a deal with the company that I would

spend six months on site to learn a
business and then I could work remote

from anywhere based in Germany.

And I chose Berlin.

At that company I did
quite interesting projects.

I was part of a very smart team
of data scientists and I was there

talk to prepare the data pretty
much completely in Databricks.

So I wrote a lot of spark code,
did notebook orchestration created

of table medallion architecture.

Yeah, the bronze, silver,
and gold clean layers.

And generally passed on my knowledge
to a junior data engineer who

was also employed next to me.

And that was my first experience
really coaching junior people.

And I really enjoyed it.

Um, it was fun to not only consume
knowledge, but also be able

to pass it on to someone else.

I.

Even though I liked the job, I again,
very quickly noticed that I was

just not built to be an employee.

I really missed having the independence,
uh, that freelancing gave me.

I was also a bit frustrated by the
approach of the company when it came

to data ingestion of sensor data, which
was, um, this topic is, um, perhaps a bit

more too deep to explore in this video.

Let's say that particular challenge in
manufacturing, which actually forced me

to take my next step, which I will get
to a big challenge in manufacturing,

is that there's a very high volume
of time series data You have to know.

A factory is equipped
with many, many sensors.

All of these sensors produce data in
sometimes 20 millisecond frequency.

But let's assume even if it's one second
frequency, you're dealing with a lot of

time series generated every single moment.

And processing this data is not
as straightforward as processing

sale data or customer data that,
for example, Nike would be toast.

And at that time, I discovered.

Um, let's say the field of industrial iot.

So it's a bit similar to what I
did from my first startup gig when

I did a bunch of airflow work.

But there it was more focused on a
concept called the Unified Namespace.

And the unified namespace was a
concept developed by, uh, Walker

Walker Reynolds in the United States.

It was essentially an
event driven architecture.

Based on the MQTT protocol, it's a very
lightweight protocol, um, very commonly

used in IoT applications because allows
to transport data over insecure, um, or

unstable network in very large volumes.

It has a very small memory footprint,

but I got really upset by this technology
'cause I thought that, look, this

architecture is a solution to industrial.

Data infrastructure, in fact.

So what a lot of companies, including
Novelli was really struggling with, I felt

that the unified namespace was a solution.

So as an employee, I really tried
to push this concept in the company.

And yeah, of course I got
some interest, some traction.

I could do it as, um, let's
say as a side project.

But the whole strategy of
the company was set in stone.

Essentially what usually happens in a big
enterprise, some big consultants come in,

they develop a fancy 5-year-old, um, big
plan, and then the company executes that.

So I quickly realized that there was no
way the company would take a big risk

like that and explore this new technology.

So I felt that if I really wanted
to work on this thing called the

unified namespace, I would have to
go and work as a freelancer again.

Which meant that for, I think the
fourth time now, I quit my job.

I resigned, which troubled
the company very much.

They really tried to keep me there
'cause my work was generally good.

It was also very hard to hire
competent data engineers at that time.

Nevertheless, in July, 2023, I
resigned from this job and started

to work as a freelancer again.

Again, at this time I was in Berlin, so
in Germany, so I reopened a freelancing

business in Germany, and I even got
six months of funding because I managed

to, let's say, create a very coherent
business plan of serving the aluminum

industry in digital transformation.

That what would be my business venture.

I would, I would specialize.

On the aluminum industry and focus on data
integration using the unified namespace.

And I really hoped that my previous
background in aluminum or aluminum,

depending where you're from, and data will
allow me to build systems completely on my

own for this industry and now real life.

That's quite a naive venture,
even though it's quite niche,

it's a very conservative industry.

I tried to build a business out of
this for two years, so I focused

really hard on conferences.

I flew to two conferences in
aluminum industry, so once a

year, once it was in, um, Dubai.

I.

The second time was in Leo in France.

Shook a lot of hands,
talked to a lot of people.

I even flew to a conference in Istanbul
where I presented the unified native

space for the aluminum industry.

I wrote quite a lot of blog
posts and quite a lot of, um,

articles in industrial magazines.

I had a blog.

I had a daily, daily newsletter.

I also got in touch with a company called
the United Manufacturing Hub in Germany.

That was a company that open sourced
a unified namespace for the industry.

I worked with them briefly as a content
creator, but after two years, I just

noticed that you have to remember, my
background was always in data, machine

learning, data analysis, analytics,
data engineering, and I felt I was

drifting very far away into IIOT.

I was now dealing with integrating
data from controllers from industrial

SCADA systems, which stands for
Supervisory Control and Data Acquisition.

And I felt I was going too broad and
quite recently, actually, I would say

about a month ago, I secured my next gig,
which was for a big renewables company.

Again, a German company.

Um.

Oh, I have to mention in the fact
that I frankly also returned to

Belgium again, I, from Berlin.

That was in September, 2024 or last
year, I reopened the business in Belgium.

'cause I figured that I wanted to focus
my life in Belgium in the long run

that I would eventually return there.

I was also quite done
with, uh, with Germany and.

I found a very good contract through
this Belgian l lt, um, through a

large recruiting company called Hayes,
who offered me, who thought that I

had experience in data engineering,
especially in SCADA systems and

manufacturing, and they, someone that
could build a data quality framework

built on Databricks, and that's how
I actually got back into Databricks.

I saw that.

The old world, which I've left, namely
data and ventured into manufacturing,

suddenly showed itself back to me.

I was working once again with Spar in
Databricks and really enjoyed it, which

is why I now decided to focus my career
more on a specific platform instead

of a particular niche like aluminum or
manufacturing, or even unified namespace.

I felt this industry will need
a couple more years to really

implement this technology.

But for me, I wanted to work on
something that was already booming.

And the conference, the Databricks
Conference, which I'll be going to

in um, June in San Francisco has
60,000 attendees, which make me

realize that I think Databricks is
a technology that is going to stay.

It's a good product, it's a good product.

They make the whole data.

Let's say product lifecycle, very easy.

'cause they do the data management,
the processing, the model serving.

Databricks can do it all eventually,
which is why I decided to focus

my career, let's say, on my
freelancing business on Databricks.

I'm continually, I'm
continuously working on various.

Subfield such as data integration
into Databricks from SCADA systems,

data quality, bis, spark, and
processing data visualization.

So all the various experience that
I've built over the last five years,

I feel that they come nicely together
in this new focus, um, Databricks,

which I'm currently exploring.

So that's where I'm currently at.

Um, let's look at what the future
holds and what I learned from the past.

First of all, I thought that having
a degree in data was, well, it

was, first of all, impossible.

At that time, 2018, there weren't really
any data science university roles.

Sure there were some bootcamps.

I never did those.

Um, though I felt that you can get
quite far by still learning by yourself.

But whether that's still
through today, I don't know.

But it worked for me at that time.

I also realized that it's fine to look
for, um, for yourself and what you

want to do in the first, let's say,
five or even 10 years of your career.

I mean, I went from growth engineering,
from MATLAB and Simulink to SQL queries,

to Art Python, to playing with machine
learning on Gaggle to going more

into data engineering and bis, spark,
Databricks, going to manufacturing

into MQTT and Unified namespace.

And now back into Databricks.

And I felt that all the things
I've learned were all very

relevant, um, to my work now.

So I think at some point it's just about
sticking with something long enough,

even though I've jumped ships very long,
I've never had a job for two years.

I've always quit before Denmark
was never promoted, but still I did

really well because in the long term,
if you look at my career, data has

been the central point of it all.

I've approached it from a different
direction, sometimes more from analytics,

sometimes more from engineering.

But I've always worked a lot with data and
I've always focused on the fundamentals.

Things like SQL writing with
Python code, um, but also exploring

pattern performing frameworks.

So what do I hope you get from this video?

I think for this video it was
more, um, maybe even for me

just to say that it's fine.

To look for your way.

I think it's also fine to just follow your
intuition and as long as you're putting

the work and learn, you will be fine.

Even now, the thousand 25, when people say
that the market is crashing and tech jobs

are decreasing, I haven't really feel,
uh, felt, uh, lack of potential work.

Sure.

I've had like months before where
I was looking in between projects.

For work, but I've always felt that
if you know what you're doing and you

can create value for a company, you'll
never be out of, uh, out of business.

Maybe a final tip I would give you as a
viewer is that even now, even though I'm

back focused on Databricks, I'm always
very focused on one particular industry.

Like I try to avoid working
for, let's say Nike or Facebook

or anything, anything else.

Given that I've worked in manufacturing
for five years and industry in general,

I'm still very focused on serving
either manufacturing or utilities

or chemical companies, essentially
anything within the industry.

This really allows me to set me apart
from other freelancers and eventually

build experience in a particular niche.

So I think that's it for this video.

Uh, thank you for hearing
me ramble about my career.

I hope it was useful for you and for me.

It was nice to revisit the last six
years of my career and I'm quite

excited about what's to come on
the future and anything interesting

I will share with you on my chat.

Thank you and bye-bye.

My Six Years of Working in Data
Broadcast by