admin 管理员组

文章数量: 887044


2024年1月27日发(作者:mongodb数据库的优势)

附录一外文原文

(摘自Marty Hall and Larry Brown,2000-07,Core Servlets and

JavaServer Pages 第一章)

An Overview of Servlet and JSP Technology

1.1 A Servlet's Job

Servlets are Java programs that run on Web or application servers, acting as

a middle layer between requests coming from Web browsers or other HTTP

clients and databases or applications on the HTTP server. Their job is to perform

the following tasks, as illustrated in Figure 1-1.

Figure 1-1

1.Read the explicit data sent by the client.

The end user normally enters this data in an HTML form on a Web page.

However, the data could also come from an applet or a custom HTTP client

program.

2.Read the implicit HTTP request data sent by the browser.

Figure 1-1 shows a single arrow going from the client to the Web server

(the layer where servlets and JSP execute), but there are really two varieties of

data: the explicit data that the end user enters in a form and the

behind-the-scenes HTTP information. Both varieties are critical. The HTTP

information includes cookies, information about media types and compression

schemes the browser understands, and so on.

3.Generate the results.

This process may require talking to a database, executing an RMI or EJB

call, invoking a Web service, or computing the response directly. Your real data

may be in a relational database. Fine. But your database probably doesn't speak

HTTP or return results in HTML, so the Web browser can't talk directly to the

database. Even if it could, for security reasons, you probably would not want it

to. The same argument applies to most other need the Web

middle layer to extract theresultsinside a document.

4.Send the explicit data (i.e., the document) to the client.

This document can be sent in a variety of formats, including text (HTML or

XML), binary (GIF images), or even a compressed format like gzip that is

layered on top of some other underlying format. But, HTML is by far the most

common format, so an important servlet/JSP task is to wrap the results inside of

HTML.

5.Send the implicit HTTP response data.

Figure 1-1 shows a single arrow going from the Web middle layer (the

servlet or JSP page) to the client. But, there are really two varieties of data sent:

the document itself and the behind-the-scenes HTTP information. Again, both

varieties are critical to effective development. Sending HTTP response data

involves telling the browser or other client what type of document is being

returned (e.g., HTML), setting cookies and caching parameters, and other such

tasks.

1.2 Why Build Web Pages Dynamically?

Many client requests can be satisfied by prebuilt documents, and the server

would handle these requests without invoking servlets. In many cases, however,

a static result is not sufficient, and a page needs to be generated for each request.

There are a number of reasons why Web pages need to be built on-the-fly:

1.The Web page is based on data sent by the client.

For instance, the results page from search engines and order-confirmation

pages at online stores are specific to particular user requests. You don't know

what to display until you read the data that the user submits. Just remember that

the user submits two kinds of data: explicit (i.e., HTML form data) and implicit

(i.e., HTTP request headers). Either kind of input can be used to build the output

page. In particular, it is quite common to build a user-specific page based on a

cookie value.

2.The Web page is derived from data that changes frequently.

If the page changes for every request, then you certainly need to build the

response at request time. If it changes only periodically, however, you could do

it two ways: you could periodically build a new Web page on the server

(independently of client requests), or you could wait and only build the page

when the user requests it. The right approach depends on the situation, but

sometimes it is more convenient to do the latter: wait for the user request. For

example, a weather report or news headlines site might build the pages

dynamically, perhaps returning a previously built page if that page is still up to

date.

3.The Web page uses information from corporate databases or other

server-side sources.

If the information is in a database, you need server-side processing even if

the client is using dynamic Web content such as an applet. Imagine using an

applet by itself for a search engine site:

"Downloading 50 terabyte applet, please wait!" Obviously, that is silly; you

need to talk to the database. Going from the client to the Web tier to the database

(a three-tier approach) instead of from an applet directly to a database (a two-tier

approach) provides increased flexibility and security with little or no

performance penalty. After all, the database call is usually the rate-limiting step,

so going through the Web server does not slow things down. In fact, a three-tier

approach is often faster because the middle tier can perform caching and

connection pooling.

In principle, servlets are not restricted to Web or application servers that

handle HTTP requests but can be used for other types of servers as well. For

example, servlets could be embedded in FTP or mail servers to extend their

functionality. And, a servlet API for SIP (Session Initiation Protocol) servers

was recently standardized (see /en/jsr/detail?id=116). In practice,

however, this use of servlets has not caught on, and we'll only be discussing

HTTP servlets.

1.3 The Advantages of Servlets Over "Traditional" CGI

Java servlets are more efficient, easier to use, more powerful, more portable,

safer, and cheaper than traditional CGI and many alternative CGI-like

technologies.

1.Efficient

With traditional CGI, a new process is started for each HTTP request. If the

CGI program itself is relatively short, the overhead of starting the process can

dominate the execution time. With servlets, the Java virtual machine stays

running and handles each request with a lightweight Java thread, not a

heavyweight operating system process. Similarly, in traditional CGI, if there are

N requests to the same CGI program, the code for the CGI program is loaded

into memory N times. With servlets, however, there would be N threads, but

only a single copy of the servlet class would be loaded. This approach reduces

server memory requirements and saves time by instantiating fewer objects.

Finally, when a CGI program finishes handling a request, the program

terminates. This approach makes it difficult to cache computations, keep

database connections open, and perform other optimizations that rely on

persistent data. Servlets, however, remain in memory even after they complete a

response, so it is straightforward to store arbitrarily complex data between client

requests.

2.Convenient

Servlets have an extensive infrastructure for automatically parsing and

decoding HTML form data, reading and setting HTTP headers, handling cookies,

tracking sessions, and many other such high-level utilities. In CGI, you have to

do much of this yourself. Besides, if you already know the Java programming

language, why learn Perl too? You're already convinced that Java technology

makes for more reliable and reusable code than does Visual Basic, VBScript, or

C++. Why go back to those languages for server-side programming?

3.Powerful

Servlets support several capabilities that are difficult or impossible to

accomplish with regular CGI. Servlets can talk directly to the Web server,

whereas regular CGI programs cannot, at least not without using a

server-specific API. Communicating with the Web server makes it easier to

translate relative URLs into concrete path names, for instance. Multiple servlets

can also share data, making it easy to implement database connection pooling

and similar resource-sharing optimizations. Servlets can also maintain

information from request to request, simplifying techniques like session tracking

and caching of previous computations.

4.Portable

Servlets are written in the Java programming language and follow a

standard API. Servlets are supported directly or by a plugin on virtually every

major Web server. Consequently, servlets written for, say, Macromedia JRun can

run virtually unchanged on Apache Tomcat, Microsoft Internet Information

Server (with a separate plugin), IBM WebSphere, iPlanet Enterprise Server,

Oracle9i AS, or StarNine WebStar. They are part of the Java 2 Platform,

Enterprise Edition (J2EE; see /j2ee/), so industry support for

servlets is becoming even more pervasive.

5.Inexpensive

A number of free or very inexpensive Web servers are good for

development use or deployment of low- or medium-volume Web sites. Thus,

with servlets and JSP you can start with a free or inexpensive server and migrate

to more expensive servers with high-performance capabilities or advanced

administration utilities only after yourproject meets initial success. This is in

contrast to many of the other CGI alternatives, which require a significant initial

investment for the purchase of a proprietarypackage.

Price and portability are somewhat connected. For example, Marty tries to

keep track of the countries of readers that send him questions by email. India

was near the top of the list, probably #2 behind the U.S. Marty also taught one

of his JSP and servlet training courses (see /) in

Manila, and there was great interest in servlet and JSP technology there.

Now, why are India and the Philippines both so interested? We surmise that

the answer is twofold. First, both countries have large pools of well-educated

software developers. Second, both countries have (or had, at that time) highly

unfavorable currency exchange rates against the U.S. dollar. So, buying a

special-purpose Web server from a U.S. company consumed a large part of early

project funds.

But, with servlets and JSP, they could start with a free server: Apache

Tomcat (either standalone, embedded in the regular Apache Web server, or

embedded in Microsoft IIS). Once the project starts to become successful, they

could move to a server like Caucho Resin that had higher performance and

easier administration but that is not free. But none of their servlets or JSP pages

have to be rewritten. If their project becomes even larger, they might want to

move to a distributed (clustered) environment. No problem: they could move to

Macromedia JRun Professional, which supports distributed applications (Web

farms). Again, none of their servlets or JSP pages have to be rewritten. If the

project becomes quite large and complex, they might want to use Enterprise

JavaBeans (EJB) to encapsulate their business logic. So, they might switch to

BEA WebLogic or Oracle9i AS. Again, none of their servlets or JSP pages have

to be rewritten. Finally, if their project becomes even bigger, they might move it

off of their Linux box and onto an IBM mainframe running IBM WebSphere.

But once again, none of their servlets or JSP pages have to be rewritten.

6.Secure

One of the main sources of vulnerabilities in traditional CGI stems from the

fact that the programs are often executed by general-purpose operating system

shells. So, the CGI programmer must be careful to filter out characters such as

backquotes and semicolons that are treated specially by the shell. Implementing

this precaution is harder than one might think, and weaknesses stemming from

this problem are constantly being uncovered in widely used CGI libraries.

A second source of problems is the fact that some CGI programs are

processed by languages that do not automatically check array or string bounds.

For example, in C and C++ it is perfectly legal to allocate a 100-element array

and then write into the 999th "element," which is really some random part of

program memory. So, programmers who forget to perform this check open up

their system to deliberate or accidental buffer overflow attacks.

Servlets suffer from neither of these problems. Even if a servlet executes a

system call (e.g., with or JNI) to invoke a program on the local

operating system, it does not use a shell to do so. And, of course, array bounds

checking and other memory protection features are a central part of the Java

programming language.

7.Mainstream

There are a lot of good technologies out there. But if vendors don't support

them and developers don't know how to use them, what good are they? Servlet

and JSP technology is supported by servers from Apache, Oracle, IBM, Sybase,

BEA, Macromedia, Caucho, Sun/iPlanet, New Atlanta, ATG, Fujitsu, Lutris,

Silverstream, the World Wide Web Consortium (W3C), and many others.

Several low-cost plugins add support to Microsoft IIS and Zeus as well. They

run on Windows, Unix/Linux, MacOS, VMS, and IBM mainframe operating

systems. They are the single most popular application of the Java programming

language. They are arguably the most popular choice for developing medium to

large Web applications. They are used by the airline industry (most United

Airlines and Delta Airlines Web sites), e-commerce (), online banking

(First USA Bank, Banco Popular de Puerto Rico), Web search engines/portals

(), large financial sites (American Century Investments), and

hundreds of other sites that you visit every day.

Of course, popularity alone is no proof of good technology. Numerous

counter-examples abound. But our point is that you are not experimenting with a

new and unproven technology when you work with server-side Java.

附录二中文翻译

Servlet和JSP技术简介

1.1 Servlet的功能

Servlets是运行在Web或应用服务器上的Java程序,它是一个中间层,负责连接来自Web浏览器或其他HTTP客户端和HTTP服务器上的数据库或应用程序。Servlet的工作是执行下面的任务,如图1-1所示。

图1-1

1.读取客户端发送的显式数据。

客户端用户一般在页面的HTML表单中输入这些数据。然而,数据还有可能来自applet或定制的HTTP客户程序。

2.读取由浏览器发送的隐式请求数据。

图1-1中显示了一条从客户端到Web服务器的单箭头,但实际上从客户端传送到Web服务器的数据有两种,它们分别为用户在表单中输入的显式数据,以及后台的HTTP信息。两种数据都很重要。HTTP信息包括cookie、浏览器所能识别的媒体类型和压缩模式等。

3.生成结果。

这个过程可能需要访问数据库、执行RMI或EJB调用、调用Web服务,或者直接计算得出对应的响应。实际的数据可能存储在关系型数据库中。但是该数据库可能不理解HTTP,或者不能返回HTML形式的结果,所以Web浏览器不能直接与数据库进行会话。即使它能够做到这一点,为了安全上的考虑,我们也不希望让它这么做。对应大多数其他应用程序,也存在类似的问题。因此,我们需要Web中间层从HTTP流中提取输入数据,与应用程序会话,并将结果嵌入到文档中。

4.向客户发送显式数据(即文档)。

这个文档可以用各种格式发送,包括文本(HTML或XML),二进制(GIF

图像),甚至可以是建立在其他底层格式之上的压缩格式,如gzip。但是,到目前为止,HTML式最常用的格式,故而Servlet/JSP的重要任务之一就是将结果包装到HTML中。

5.发送隐式的HTTP响应数据。

图1-1中显示了一条从Web中间层到客户端的单箭头。但是,实际发送的数据有两种:文档本身,以及后台的HTTP信息。同样,两种数据对开发来说都是至关重要的。HTTP响应数据的发送过程涉及告知浏览器或其他客户程序所返回文档的类型(如HTML),设置cookie和缓存参数,以及其他类似的任务。

1.2 动态构建网页的原因

预先建立的文档可以满足客户的许多请求,服务器无需调用servlet就可以处理这些请求。然而,许多情况下静态的结果不能满足要求,我们需要针对每个请求生成一个页面。实时构建页面的理由有很多种:

1、网页基于客户发送的数据。

例如,搜索引擎生成的页面,以及在线商店的订单确认页面,都要针对特定的用户请求而产生。在没有读取到用户提交的数据之前,我们不知道应该显示什么。要记住,用户提交两种类型的数据:显示(即HTML表单的数据)和隐式(即HTTP请求的报头)。两种输入都可用来构建输出页面。基于cookie值针对具体用户构建页面的情况尤其普遍。

2、页面由频繁改变的数据导出。

如果页面需要根据每个具体的请求做出相应的改变,当然需要在请求发生时构建响应。但是,如果页面周期性地改变,我们可以用两种方式来处理它:周期性地在服务器上构建新的页面(和客户请求无关),或者仅仅在用户请求该页面时再构建。具体应该采用哪种方式要根据具体情况而定,但后一种方式常常更为方便,因为它只需简单地等待用户的请求。例如,天气预报或新闻网站可能会动态地构建页面,也有可能会返回之前构建的页面(如果它还是最新的话)。

3、页面中使用了来自公司数据库或其他数据库断数据源的信息。

如果数据存储在数据库中,那么,即使客户端使用动态Web内容,比如applet,我们依旧需要执行服务器端处理。想象以下,如果一个搜索引擎网站完全使用

applet,那么用户将会看到:“正在下载50TB的applet,请等待!”。显然,这样很愚蠢;这种情况下,我们需要与数据库进行会话。从客户端到Web层再到数据库(三层结构),要比从applet直接到数据库(二层结构)更灵活,也更安全,而性能上的损失很少甚至没有。毕竟数据库调用通常是对速度影响最大的步骤,因而,经过中间层可以执行高速缓存和连接共享。

理论上讲,servlet并非只用于处理HTTP请求的Web服务器或应用服务器,它同样可以用于其他类型的服务器。例如,servlet能够嵌入到FTP或邮件服务器中,扩展他们的功能。而且,用于会话启动协议服务器的servlet API最近已经被标准化(参见/en/jsr/detail?id=116)。但在实践中,servlet的这种用法尚不流行,在此,我们只论述HTTP Servlet。

1.3 Servlet相对于“传统”CGI的优点

和传统CGI及许多类CGI技术相比,Java servlet更高效、更易用、更强大、更容易移植、更安全、也更廉价。

1、高效

应用传统的CGI,针对每个HTTP请求都用启动一个新的进程。如果CGI程序自身相对比较简短,那么启动进程的开销会占用大部分执行时间。而使用servlet,Java虚拟机会一直运行,并用轻量级的Java线程处理每个请求,而非重量级的操作系统进程。类似地,应用传统的CGI技术,如果存在对同一CGI程序的N个请求,那么CGI程序的代码会载入内存N次。同样的情况,如果使用servlet,则启动N个线程,但是仅仅载入servlet类的单一副本。这种方式减少了服务器的内存需求,通过实例化更少的对象从而节省了时间。最后,当CGI程序结束对请求的处理之后,程序结束。这种方式难以缓存计算结果,保持数据库连接打开,或是执行依靠持续性数据的其他优化。然而,servlet会一直停留在内存中(即使请求处理完毕),因而可以直接存储客户请求之间的任意复杂数据。

2、便利

Servlet提供大量的基础构造,可以自动分析和解码HTML的表单数据,读取和设置HTTP报头,处理cookie,跟踪会话,以及其他此类高级功能。而在CGI中,大部分工作都需要我们资金完成。另外,如果您已经了解了Java编程语言,为什么还有学校Perl呢?您已经承认应用Java技术编写的代码要比Visual

Basic,VBScript或C++编写的代码更可靠,且更易重用,为什么还有倒退回去选择那些语言来开发服务器端的程序呢?

3、强大

Servlet支持常规CGI难以实现或根本不能实现的几项功能。Servlet能够直接于Web服务器对话,而常规的CGI程序做不到这一点,至少在不使用服务器专有API的情况下是这样。例如,与Web服务器的通信使得讲相对URL转换成具体的路径名变得更为容易。多个servlet还可以共享数据,从而易于实现数据库连接共享和类似的资源共享优化。Servlet还能维护请求之间的信息,使得诸如会话跟踪和计算结果缓存等技术变得更为简单。

4、可移植性

Servlet使用Java编程语言,并且遵循标准的API。所有主要的Web服务器。实际上都直接或通过插件支持servlet。因此。为Macromedia JRun编写的servlet,可以不经过任何修改地在Apache Tomcat,Microsoft Internet Information Server,IBM WebSphere,iPlanet Enterprise Serve,Oracle9i AS 或者StrNine WebStar上运行。他们是java2平台企业版的一部分,所以对servlet的支持越来越普遍。

5、廉价

对于开发用的网站、低容量或中等容量网站的部署,有大量免费或极为廉价的Web服务器可供选择。因此,通过使用servlet和jsp,我们可以从免费或廉价的服务器开始,在项目获得初步成功后,在移植到更高性能或高级管理工具的昂贵的服务器上。这与其他CGI方案形成鲜明的对比,这些CGI方案在初期都需要为购买专利软件包投入大量的资金。

价格和可移植性在某种程度上是相互关联的。例如,Marty记录了所有通过电子邮件向他发送问题的读者的所在国。印度接近列表的顶端,可能仅次于美国。Marty曾在马尼拉讲授过jsp和servlet培训课程,那儿对servlet和jsp技术抱很大的兴趣。

那么,为什么印度和菲律宾都对这项技术这么感兴趣呢?我们推测答案可能分两部分。首先,这两个国家都拥有大量训练有素的软件开发人员。其次,这两个国家的货币对美元的汇率都极为不利。因此,从美国公司那里购买专用Web服务器会消耗掉项目的大部分前期资金。

但是,使用servlet 和JSP,他们能够从免费的服务器开始:Apache Tomcat。项目取得成功之后,他们可以转移到性能更高、管理更容易,但需要付费的服务器。他们的servlet和jsp不需要小红心编写。如果他们的项目变得更庞大,他们或许希望转移到分布式环境。没有问题:他们可以转而使用Macromedia JRun

Professional,该服务器支持分布式应用。同样,他们的servlet和jsp没有任何部分需要重写。如果项目变得极为庞大,错综复杂,他们或许希望使用Enterprise

JavaBeans来封装他们的商业逻辑。因此,他们可以切换到BEA WebLogic或Oracle9i AS。同样,不需要对servlet和jsp做出更改。最后,如果他们的项目变得更庞大,他们或许将他从Linux转移到运行IBM WebSphere的IBM大型机上。他们依旧还是不需要做出任何更改。

6、安全

传统CGI程序中主要的漏洞来源之一就是,CGI程序常常由通过的操作系统外壳来执行。因此,CGI程序必须仔细地过滤掉那些可能被外壳特殊处理的字符,如反引导和分号。实现这项预防措施的难度可能超出我们的想象,在广泛应用的CGI库中,不断发现由这类问题引发的弱点。

问题的第二个来源是,一些CGI程序用不自动检查数组和字符串边界的语言编写而成。例如,在C和C++中,可以分配一个100个元素的数组,然后向第999个“元素”写入数据——实际上是程序内存的随机部分,这完全合法。因而,如果程序员忘记执行这项检查,就会将系统暴露在蓄意或偶然的缓冲区溢出攻击之下。

Servlet不存在这些问题。即使servlet执行系统调用激活本地操作系统上的程序,它也不会用到外壳来完成这项任务。当然,数组边界的检查以及其他内存包含特性是java编程语言的核心部分。

7、主流

虽然存在许多很好的技术,但是,如果提供商不支持他们,或开发人员不知道如何使用这些技术,那么它们的优点又如何体现呢?servlet和jsp技术得到服务器提供商的广泛支持,包括Apache,Oracle,IBM,Sybase,BEA,Macromedia,Caucho,Sun/iPlanet,New Atlanta,ATG,Fujitsu,Lutris,Silverstream,万维网联盟(W3C),以及其他服务器。存在几种低廉的插件,通过应用这些插件,

Microsoft IIS和Zeus也同样支持servlet和jsp技术,它们运行在Windows,Unix/Linus,MacOS,VMS和IBM大型机操作系统之上。它们用在航空业、电子商务、在线银行、web搜索引擎、门户、大型金融网站、以及成百上千您日常光顾的其他网站。

当然,仅仅是流行并不能证明技术的优越性。很多泛美的例子。但我们的立场是:服务器端Java本非一项新的、未经证实的技术。


本文标签: 数据 服务器 请求 数据库 需要