9:59:59.000,9:59:59.000
Our weekly video hangout series!
9:59:59.000,9:59:59.000
I think - let's see, we started a little bit off time
9:59:59.000,9:59:59.000
so I'll say it again:
9:59:59.000,9:59:59.000
Welcome to GV Face, our weekly video hangout series!
9:59:59.000,9:59:59.000
Today, we are celebrating the 25th birthday of the world wide web.
9:59:59.000,9:59:59.000
Pretty exciting. That was on Wednesday.
9:59:59.000,9:59:59.000
Um, we've got a really all-star lineup of guests
9:59:59.000,9:59:59.000
on today's program.
9:59:59.000,9:59:59.000
Um, moving from left to right, we have:
9:59:59.000,9:59:59.000
Alan Emtage, a very special guest who is
9:59:59.000,9:59:59.000
gonna talk to us about his very special creation
9:59:59.000,9:59:59.000
of, uh, the first web browser...
9:59:59.000,9:59:59.000
Um! We have Jeremy Clark, in Montreal -
9:59:59.000,9:59:59.000
Jeremy is a technical director at Global Voices.
9:59:59.000,9:59:59.000
Josh Levy, from Free Press,
9:59:59.000,9:59:59.000
in Massachusetts, in the U.S.
9:59:59.000,9:59:59.000
and Renata Avila, campaign manager [br]for the Web We Want
9:59:59.000,9:59:59.000
Creative Commons extraordinaire, and
9:59:59.000,9:59:59.000
GV star.
9:59:59.000,9:59:59.000
who is joining us from Berlin!
9:59:59.000,9:59:59.000
Welcome, everybody!
9:59:59.000,9:59:59.000
Um. So we wanted to start today's show
9:59:59.000,9:59:59.000
by talking a little bit about the world wide web
9:59:59.000,9:59:59.000
and the internet.
9:59:59.000,9:59:59.000
'Cuz a lot of people think that they're the same thing
9:59:59.000,9:59:59.000
when actually, that's not quite true.
9:59:59.000,9:59:59.000
I want to first turn to Jeremy
9:59:59.000,9:59:59.000
and just ask, Jer, could you
9:59:59.000,9:59:59.000
break it down for us, like,
9:59:59.000,9:59:59.000
I thought that the internet was invented in the 70's
9:59:59.000,9:59:59.000
but, if it's the 25th birthday of the web,
9:59:59.000,9:59:59.000
what does that mean?
9:59:59.000,9:59:59.000
Jeremy Clark: Okay, well, the
9:59:59.000,9:59:59.000
best place to start, I think, is
9:59:59.000,9:59:59.000
the internet - it has existed in various formats
9:59:59.000,9:59:59.000
since the 1970's, as you said,
9:59:59.000,9:59:59.000
but it was the web that really made it
9:59:59.000,9:59:59.000
enter our homes.
9:59:59.000,9:59:59.000
and, so, understanding the relationship is important.
9:59:59.000,9:59:59.000
So, the internet was invented by
9:59:59.000,9:59:59.000
the U.S. Government in a lot of senses...
9:59:59.000,9:59:59.000
...a mix of military and science funding
9:59:59.000,9:59:59.000
that developed the network of [br]actual computers
9:59:59.000,9:59:59.000
that can communicate with each other over
9:59:59.000,9:59:59.000
wires.
9:59:59.000,9:59:59.000
Now, another related technology that is also compri--
9:59:59.000,9:59:59.000
[amends] uh, built in to the web
9:59:59.000,9:59:59.000
is called hypertext. And that is the notion
9:59:59.000,9:59:59.000
of documents that can link between each other
9:59:59.000,9:59:59.000
immediately, without having to go and fetch
9:59:59.000,9:59:59.000
a separate document. Um.
9:59:59.000,9:59:59.000
So there were lots of systems since the 1960s
9:59:59.000,9:59:59.000
that were trying to implement hypertext, like,
9:59:59.000,9:59:59.000
Xanadu is an example,
9:59:59.000,9:59:59.000
uh, but all of them were commercial, [br]expensive, closed,
9:59:59.000,9:59:59.000
and none of them were very popular.
9:59:59.000,9:59:59.000
So, Tim Berners-Lee, who is the[br]"inventor of the internet,"
9:59:59.000,9:59:59.000
[corrects himself] of the web, [br]obviously, the World Wide Web -
9:59:59.000,9:59:59.000
Um. [Tim Berners-Lee] put those two things together
9:59:59.000,9:59:59.000
by building a service that runs
9:59:59.000,9:59:59.000
on top of the internet, and he
9:59:59.000,9:59:59.000
called it the World Wide Web.
9:59:59.000,9:59:59.000
So what the World Wide Web is, is the
9:59:59.000,9:59:59.000
decentralized hypertext engine
9:59:59.000,9:59:59.000
that we use to communicate between
9:59:59.000,9:59:59.000
computers' web pages.
9:59:59.000,9:59:59.000
So what makes up WEB is three things:
9:59:59.000,9:59:59.000
URLs (or URIs) - Universal Resource Locator
9:59:59.000,9:59:59.000
which are the addresses we use [br]to find things on the web,
9:59:59.000,9:59:59.000
[#2] HTML, which is the
9:59:59.000,9:59:59.000
HyperText Markup Lanuage
9:59:59.000,9:59:59.000
which is the way that the information
9:59:59.000,9:59:59.000
is stored and sent
9:59:59.000,9:59:59.000
so that we can then use browsers
9:59:59.000,9:59:59.000
to view HTML, and then
9:59:59.000,9:59:59.000
all the documents can be understood
9:59:59.000,9:59:59.000
and then also they display the links
9:59:59.000,9:59:59.000
so that the hypertext part of it works
9:59:59.000,9:59:59.000
and we can jump around from page to page.
9:59:59.000,9:59:59.000
Um, the final part is HTTP, which is
9:59:59.000,9:59:59.000
the HyperText Transfer Protocol
9:59:59.000,9:59:59.000
which is the communication method
9:59:59.000,9:59:59.000
by which the different computers can
9:59:59.000,9:59:59.000
talk to each other and send the
9:59:59.000,9:59:59.000
HTML documents back and forth
9:59:59.000,9:59:59.000
depending on the URLs.
9:59:59.000,9:59:59.000
Um. So, when he built it, there were some
9:59:59.000,9:59:59.000
very important things that he [br]built into this system
9:59:59.000,9:59:59.000
that didn't exist before.
9:59:59.000,9:59:59.000
And the main one is
9:59:59.000,9:59:59.000
universal authorship.
9:59:59.000,9:59:59.000
So he always intended that anyone[br]would be able
9:59:59.000,9:59:59.000
to access these webpages,
9:59:59.000,9:59:59.000
and anyone would be able to
9:59:59.000,9:59:59.000
add their own webpages, without
9:59:59.000,9:59:59.000
asking for permission.
9:59:59.000,9:59:59.000
With the very explicit special condition
9:59:59.000,9:59:59.000
that anyone can link to any other webpage
9:59:59.000,9:59:59.000
without permission.
9:59:59.000,9:59:59.000
Previous hypertext systems required that
9:59:59.000,9:59:59.000
basically, for you to link to me,
9:59:59.000,9:59:59.000
I have to accept that link, and
9:59:59.000,9:59:59.000
probably create a link back to you, and
9:59:59.000,9:59:59.000
that wasn't required on the Web, which
9:59:59.000,9:59:59.000
gives us a lot of freedom to link to people
9:59:59.000,9:59:59.000
who wouldn't want us to be able [br]to link to them, for example,
9:59:59.000,9:59:59.000
so no one can say "I'm putting up free content..."
9:59:59.000,9:59:59.000
"...but you can't send your readers here,[br]because I hate you," et cetera.
9:59:59.000,9:59:59.000
The other one is that he made it [br]completely, completely free.
9:59:59.000,9:59:59.000
So in the world of[br]inter--[fumbles for words]--programming
9:59:59.000,9:59:59.000
the most free thing is generally considered
9:59:59.000,9:59:59.000
to be the GPL [General Public License]:[br]open-source, free software licenses.
9:59:59.000,9:59:59.000
uh, and Tim Berners-Lee actually almost used
9:59:59.000,9:59:59.000
the GPL, because he wanted the web software
9:59:59.000,9:59:59.000
he was building to be free.
9:59:59.000,9:59:59.000
But at the last minute he actually changed his mind
9:59:59.000,9:59:59.000
and made it full public domain, [br]because in certain ways
9:59:59.000,9:59:59.000
the GPL is actually more restrictive, because it
9:59:59.000,9:59:59.000
forces other people - like, certain commercial actors
9:59:59.000,9:59:59.000
wouldn't have wanted to use web technology
9:59:59.000,9:59:59.000
if it were GPL, so he made it full public domain,
9:59:59.000,9:59:59.000
and then from there went on to make all of the standards
9:59:59.000,9:59:59.000
as open and, uh, general and free as possible.
9:59:59.000,9:59:59.000
Uh. So that's my extremely brief[br]history of the internet.
9:59:59.000,9:59:59.000
If anyone is curious, he wrote a wonderful book
9:59:59.000,9:59:59.000
called "Weaving the Web" about his experiences
9:59:59.000,9:59:59.000
[enticing tone] As you can see, it's short!
9:59:59.000,9:59:59.000
And he has lots of interesting technical information
9:59:59.000,9:59:59.000
in it, without being overwhelming.
9:59:59.000,9:59:59.000
It's very approachable
9:59:59.000,9:59:59.000
and he's a really interesting person
9:59:59.000,9:59:59.000
and it - the book is much better than his tweets,
9:59:59.000,9:59:59.000
which are usually incoherent.
9:59:59.000,9:59:59.000
[one of the participants huffs out a "whew"]
9:59:59.000,9:59:59.000
Ellery: Ouch![br]Jeremy [?]: A few minutes?
9:59:59.000,9:59:59.000
Ellery: Thanks, that was - that was great, Jer!
9:59:59.000,9:59:59.000
Ellery: I mean, I think that that helps
9:59:59.000,9:59:59.000
um, in conversations about internet policy,
9:59:59.000,9:59:59.000
and internet governance, there's a lot of emphasis
9:59:59.000,9:59:59.000
on the ability to kind of create and innovate
9:59:59.000,9:59:59.000
without permission? Like, for every
9:59:59.000,9:59:59.000
to be able to build parts of the web, and
9:59:59.000,9:59:59.000
what you just laid out for us makes it clear
9:59:59.000,9:59:59.000
how important the Web piece of the infrastructure is
9:59:59.000,9:59:59.000
for that, for that capacity to become
9:59:59.000,9:59:59.000
a real tangible thing, and somebody that -[br][amends] something that now
9:59:59.000,9:59:59.000
we can do - we don't have to have
9:59:59.000,9:59:59.000
technical expertise to kind of build our own[br]spaces there.
9:59:59.000,9:59:59.000
Ellery: Um. So, I wanted to -[br]Jeremy: So um.
9:59:59.000,9:59:59.000
Jeremy: If I could add just one more thing, sorry -
9:59:59.000,9:59:59.000
Jeremy: I just wanted to give a couple examples
9:59:59.000,9:59:59.000
of things that happen over the internet
9:59:59.000,9:59:59.000
that aren't the web,
9:59:59.000,9:59:59.000
because that was the actual initial question.
9:59:59.000,9:59:59.000
So, one example would be torrents,
9:59:59.000,9:59:59.000
where you're the - two computers [br]connect to each other,
9:59:59.000,9:59:59.000
and stream information directly, without any URLs
9:59:59.000,9:59:59.000
being mixed into the process.
9:59:59.000,9:59:59.000
Um, another one is - email, at its core,
9:59:59.000,9:59:59.000
is its own communication protocol
9:59:59.000,9:59:59.000
that doesn't have to use the web,
9:59:59.000,9:59:59.000
although we often use web sites [br]to access and manage our email.
9:59:59.000,9:59:59.000
Umm. And then another one was the one[br]right before the Web came out,
9:59:59.000,9:59:59.000
a very popular protocol was called Gopher,
9:59:59.000,9:59:59.000
which people liked, and sort of worked like the Web
9:59:59.000,9:59:59.000
- you surf around and find things -
9:59:59.000,9:59:59.000
but it actually became commercial [br]right around the time that the web came out,
9:59:59.000,9:59:59.000
so people would've had to start paying,
9:59:59.000,9:59:59.000
and instead of starting to pay,
9:59:59.000,9:59:59.000
they switched to HTTP, HTML, and [br]the World Wide Web.
9:59:59.000,9:59:59.000
Ellery: Thank you.
9:59:59.000,9:59:59.000
Ellery: So I want to move to Alan, now... Um,
9:59:59.000,9:59:59.000
Alan built the first search engine.
9:59:59.000,9:59:59.000
And I'm kind of... like, overwhelmed, and feel sort of
9:59:59.000,9:59:59.000
like, giddy and nervous having him here.
9:59:59.000,9:59:59.000
Ellery: This is just - [br][Alan laughs]
9:59:59.000,9:59:59.000
Ellery: This is, like, a really big deal!
9:59:59.000,9:59:59.000
Ellery: So, Alan, just - if you could tell us -
9:59:59.000,9:59:59.000
'cuz I think a lot of people don't know about Archie -
9:59:59.000,9:59:59.000
um, it would be really cool just to hear
9:59:59.000,9:59:59.000
about how you sort of - what you were doing
9:59:59.000,9:59:59.000
that made you decide to, to do this
9:59:59.000,9:59:59.000
and kinda what it was like, and then, I mean,[br]everything you've seen since...
9:59:59.000,9:59:59.000
Unfortunately we're time limited, but...
9:59:59.000,9:59:59.000
Alan: Right.[br]Ellery: You know.
9:59:59.000,9:59:59.000
Alan [coughs]: Well, um, uh, well, that was back in
9:59:59.000,9:59:59.000
1989, and, I was working as a system administrator
9:59:59.000,9:59:59.000
for uh, McGill University - I was a grad student
9:59:59.000,9:59:59.000
for McGill University - and um, I was responsible
9:59:59.000,9:59:59.000
for getting software for - one of my responsibilities
9:59:59.000,9:59:59.000
was getting software for the faculty and the students.
9:59:59.000,9:59:59.000
And at the time, the three major [br]protocols on the internet
9:59:59.000,9:59:59.000
- this was pre Web, ummm -
9:59:59.000,9:59:59.000
was, uh, Telnet, which would allow you to log in
9:59:59.000,9:59:59.000
to a remote machine.
9:59:59.000,9:59:59.000
Email, ah, which would allow you to communicate
9:59:59.000,9:59:59.000
ah, with another - as we do now, with a, with a
9:59:59.000,9:59:59.000
remote machines, plural,
9:59:59.000,9:59:59.000
and, and FTP, which was the File Transfer Protocol,
9:59:59.000,9:59:59.000
which allowed you to move, ah, data files, or files
9:59:59.000,9:59:59.000
from one machine to another.
9:59:59.000,9:59:59.000
And at the time what we had was - people had made
9:59:59.000,9:59:59.000
- remember it was a non-commercial internet[br]at the time -
9:59:59.000,9:59:59.000
- actually, commercial traffic was forbidden[br]on the internet at the time,
9:59:59.000,9:59:59.000
because it was run by the [br]National Science Foundation
9:59:59.000,9:59:59.000
and it was using educational money
9:59:59.000,9:59:59.000
and therefore other than companies with
9:59:59.000,9:59:59.000
research arms, like IBM and HP [br]and those kinds of things,
9:59:59.000,9:59:59.000
we didn't have any commercial traffic on the internet,
9:59:59.000,9:59:59.000
which nowadays seems kind of amazing [br]to even think about -
9:59:59.000,9:59:59.000
and, ah, so what people did, were to provide
9:59:59.000,9:59:59.000
to provide free space on their machines
9:59:59.000,9:59:59.000
- and remember, you know, at the time,
9:59:59.000,9:59:59.000
a big disc would be a megabyte, you know -
9:59:59.000,9:59:59.000
and so people would provide common repositories
9:59:59.000,9:59:59.000
that you could deposit programs that you had written
9:59:59.000,9:59:59.000
datafiles, and documents, and that kinda stuff.
9:59:59.000,9:59:59.000
into these central repositories that were
9:59:59.000,9:59:59.000
spread around the internet.
9:59:59.000,9:59:59.000
Then other people could then retrieve them.
9:59:59.000,9:59:59.000
And so I spent a lot of my time trying to locate
9:59:59.000,9:59:59.000
software, or the information that my, the[br]students and the faculty were trying to find,
9:59:59.000,9:59:59.000
and I got tired of it.
9:59:59.000,9:59:59.000
and since I'm lazy and a geek, I...
9:59:59.000,9:59:59.000
I automated the process.
9:59:59.000,9:59:59.000
I got - instead of doing it manually, I had a bunch
9:59:59.000,9:59:59.000
of scripts wake up in the middle of the night[br]every night,
9:59:59.000,9:59:59.000
and go out and index files.
9:59:59.000,9:59:59.000
Now remember all of this was just file listings.
9:59:59.000,9:59:59.000
It's not like Google, it's not like [br]a search engine would be today,
9:59:59.000,9:59:59.000
it is just... filenames. All it was, was filenames.
9:59:59.000,9:59:59.000
And so what it would do
9:59:59.000,9:59:59.000
was it would go out every night,
9:59:59.000,9:59:59.000
list all the filenames in all the repositories,
9:59:59.000,9:59:59.000
and allow you to search lists of filenames.
9:59:59.000,9:59:59.000
And I only used it for myself!
9:59:59.000,9:59:59.000
I only used it, um, uh, for my own personal use.
9:59:59.000,9:59:59.000
Um, and at one point my boss,
9:59:59.000,9:59:59.000
who was also a student, a grad student
9:59:59.000,9:59:59.000
at the University, let Peter Deutsch let it be known
9:59:59.000,9:59:59.000
that, um, somebody was asking for, you know,
9:59:59.000,9:59:59.000
could they, could somebody tell them where, um,
9:59:59.000,9:59:59.000
y'know, a particular piece of software was.
9:59:59.000,9:59:59.000
And, uh, uh, we, um, uh... we, you know,
9:59:59.000,9:59:59.000
he came and asked me, [br]he knew we had this database
9:59:59.000,9:59:59.000
and he came and asked me if I could help out.
9:59:59.000,9:59:59.000
And I gave it to him, and if, y'know,
9:59:59.000,9:59:59.000
half a sec- half a minute later I had the information,
9:59:59.000,9:59:59.000
and so he put this posting online, and, umm.
9:59:59.000,9:59:59.000
People then started asking,
9:59:59.000,9:59:59.000
"Well, can you find this for me?"
9:59:59.000,9:59:59.000
And, you know, all these manual requests!
9:59:59.000,9:59:59.000
Basically - either through email, or UseNet postings -
9:59:59.000,9:59:59.000
- which is what we were using at the time -
9:59:59.000,9:59:59.000
we thought, this is silly,
9:59:59.000,9:59:59.000
there's no point doing these things manually
9:59:59.000,9:59:59.000
when we can just allow people access[br]to the database itself.
9:59:59.000,9:59:59.000
And in a moment of insanity,
9:59:59.000,9:59:59.000
we had to come up with a name for it,
9:59:59.000,9:59:59.000
and I said, "Okay, well, let's just call it ARCHI,"
9:59:59.000,9:59:59.000
which is "ARCHIVE" without the V
9:59:59.000,9:59:59.000
And, ah, and within about three or four months
9:59:59.000,9:59:59.000
we were consuming about half
9:59:59.000,9:59:59.000
of all of the traffic to eastern Canada [br][where McGill University is]
9:59:59.000,9:59:59.000
as this search engine became - as people, y'know -
9:59:59.000,9:59:59.000
- word of mouth -
9:59:59.000,9:59:59.000
you know, people who know about Archie
9:59:59.000,9:59:59.000
are generally people of a certain age...
9:59:59.000,9:59:59.000
...I won't mention what that age is, but
9:59:59.000,9:59:59.000
it's generally people who were in university
9:59:59.000,9:59:59.000
or working on the internet, so it would have been
9:59:59.000,9:59:59.000
so it would have been research people,
9:59:59.000,9:59:59.000
people in academia in the early nineties.
9:59:59.000,9:59:59.000
So Archie lasted for about, uh, [hems and haws]
9:59:59.000,9:59:59.000
Five years. Four or five years.
9:59:59.000,9:59:59.000
And, um, it only indexed FTP archives.
9:59:59.000,9:59:59.000
It never indexed the web.
9:59:59.000,9:59:59.000
Now, I went on, as Archie became popular,
9:59:59.000,9:59:59.000
and I got more involved in the standards process
9:59:59.000,9:59:59.000
and that kind of stuff,
9:59:59.000,9:59:59.000
I worked, uh, fairly closely with Tim Berners-Lee
9:59:59.000,9:59:59.000
to, uh, to standardize - for example,
9:59:59.000,9:59:59.000
I did the - I ran the committee
9:59:59.000,9:59:59.000
at the standard-setting body for the internet,
9:59:59.000,9:59:59.000
which is the IETF [br][Internet Engineering Task Force]
9:59:59.000,9:59:59.000
to standardize URLs.
9:59:59.000,9:59:59.000
Because Tim had come up with
9:59:59.000,9:59:59.000
a set of rules for URLS,
9:59:59.000,9:59:59.000
and as we looked at expanding that
9:59:59.000,9:59:59.000
to a larger range of resources,
9:59:59.000,9:59:59.000
we realized that those rules did not cover
9:59:59.000,9:59:59.000
all of the cases.