This paper presents a new approach to determining whether an interested personal name across doeuments refers to the same entity. Firstly,three vectors for each text are formed: the personal name Boolean vectors deno...This paper presents a new approach to determining whether an interested personal name across doeuments refers to the same entity. Firstly,three vectors for each text are formed: the personal name Boolean vectors denoting whether a personal name occurs the text the biographical word Boolean vector representing title, occupation and so forth, and the feature vector with real values. Then, by combining a heuristic strategy based on Boolean vectors with an agglomeratie clustering algorithm based on feature vectors, it seeks to resolve multi-document personal name coreference. Experimental results show that this approach achieves a good performance by testing on "Wang Gang" corpus.展开更多
Lithuanian Central State Archive is the biggest one within the state archival service and the only state archive where audiovisual documents are stored. There are more than 800,000 units of audiovisual documents in th...Lithuanian Central State Archive is the biggest one within the state archival service and the only state archive where audiovisual documents are stored. There are more than 800,000 units of audiovisual documents in the archive. The main laws regulating the activity of Lithuanian Central State Archive and related to audiovisual archiving are the Law on Documents and Archives of Lithuanian Republic, the Law of Cinema of Lithuanian Republic, and the Law on Copyright and Related Rights of Lithuanian Republic. There are four big collections of audiovisual documents in the Lithuanian Central State Archive--films, photo documents, sound recordings, and video recordings. The Archive's specialists have a large experience in the field of physical treatment and preservation of analogue audiovisual documents. Lithuanian Central State Archive digitizes audiovisual documents seeking the balance between long time preservation and nowadays access. Since May, 2010 till April 2013, Lithuanian Central State Archive implemented the project--Lithuanian documentaries on the Internet. During the project the Archives digitized and transferred to the Internet 1,000 titles of Lithuanian documentaries, created in the period 1919-1961. Lithuanian Central State Archive wants to popularize its collections, so various international projects are participated in.展开更多
This paper reports part of a study to develop a method for automatic multi-document summarization. The current focus is on dissertation abstracts in the field of sociology. The summarization method uses macro-level an...This paper reports part of a study to develop a method for automatic multi-document summarization. The current focus is on dissertation abstracts in the field of sociology. The summarization method uses macro-level and micro-level discourse structure to identify important information that can be extracted from dissertation abstracts, and then uses a variable-based framework to integrate and organize extracted information across dissertation abstracts. This framework focuses more on research concepts and their research relationships found in sociology dissertation abstracts and has a hierarchical structure. A taxonomy is constructed to support the summarization process in two ways: (1) helping to identify important concepts and relations expressed in the text, and (2) providing a structure for linking similar concepts in different abstracts. This paper describes the variable-based framework and the summarization process, and then reports the construction of the taxonomy for supporting the summarization process. An example is provided to show how to use the constructed taxonomy to identify important concepts and integrate the concepts extracted from different abstracts.展开更多
The paper sets out to consider only Dan Brown's frequent English word games and etymologies in his last novel, asking in how far these can be translated into related and unrelated languages. Thus translation is the i...The paper sets out to consider only Dan Brown's frequent English word games and etymologies in his last novel, asking in how far these can be translated into related and unrelated languages. Thus translation is the issue here in so far as Brown makes certain ideas, conversations, even events, in the novel relying on English word play, which may not be translatable展开更多
Integer overflow vulnerability will cause buffer overflow. The research on the relationship between them will help us to detect integer overflow vulnerability. We present a dynamic analysis methods RICB (Run-time Int...Integer overflow vulnerability will cause buffer overflow. The research on the relationship between them will help us to detect integer overflow vulnerability. We present a dynamic analysis methods RICB (Run-time Integer Checking via Buffer overflow). Our approach includes decompile execute file to assembly language; debug the execute file step into and step out; locate the overflow points and checking buffer overflow caused by integer overflow. We have implemented our approach in three buffer overflow types: format string overflow, stack overflow and heap overflow. Experiments results show that our approach is effective and efficient. We have detected more than 5 known integer overflow vulnerabilities via buffer overflow.展开更多
文摘This paper presents a new approach to determining whether an interested personal name across doeuments refers to the same entity. Firstly,three vectors for each text are formed: the personal name Boolean vectors denoting whether a personal name occurs the text the biographical word Boolean vector representing title, occupation and so forth, and the feature vector with real values. Then, by combining a heuristic strategy based on Boolean vectors with an agglomeratie clustering algorithm based on feature vectors, it seeks to resolve multi-document personal name coreference. Experimental results show that this approach achieves a good performance by testing on "Wang Gang" corpus.
文摘Lithuanian Central State Archive is the biggest one within the state archival service and the only state archive where audiovisual documents are stored. There are more than 800,000 units of audiovisual documents in the archive. The main laws regulating the activity of Lithuanian Central State Archive and related to audiovisual archiving are the Law on Documents and Archives of Lithuanian Republic, the Law of Cinema of Lithuanian Republic, and the Law on Copyright and Related Rights of Lithuanian Republic. There are four big collections of audiovisual documents in the Lithuanian Central State Archive--films, photo documents, sound recordings, and video recordings. The Archive's specialists have a large experience in the field of physical treatment and preservation of analogue audiovisual documents. Lithuanian Central State Archive digitizes audiovisual documents seeking the balance between long time preservation and nowadays access. Since May, 2010 till April 2013, Lithuanian Central State Archive implemented the project--Lithuanian documentaries on the Internet. During the project the Archives digitized and transferred to the Internet 1,000 titles of Lithuanian documentaries, created in the period 1919-1961. Lithuanian Central State Archive wants to popularize its collections, so various international projects are participated in.
文摘This paper reports part of a study to develop a method for automatic multi-document summarization. The current focus is on dissertation abstracts in the field of sociology. The summarization method uses macro-level and micro-level discourse structure to identify important information that can be extracted from dissertation abstracts, and then uses a variable-based framework to integrate and organize extracted information across dissertation abstracts. This framework focuses more on research concepts and their research relationships found in sociology dissertation abstracts and has a hierarchical structure. A taxonomy is constructed to support the summarization process in two ways: (1) helping to identify important concepts and relations expressed in the text, and (2) providing a structure for linking similar concepts in different abstracts. This paper describes the variable-based framework and the summarization process, and then reports the construction of the taxonomy for supporting the summarization process. An example is provided to show how to use the constructed taxonomy to identify important concepts and integrate the concepts extracted from different abstracts.
文摘The paper sets out to consider only Dan Brown's frequent English word games and etymologies in his last novel, asking in how far these can be translated into related and unrelated languages. Thus translation is the issue here in so far as Brown makes certain ideas, conversations, even events, in the novel relying on English word play, which may not be translatable
基金Supported by the National Natural Science Foundation of China (60903188), Shanghai Education Commission Innovation Foundation (11YZ192) and World Expo Science and Technology Special Fund of Shanghai Science and Technology Commission (08dz0580202).
文摘Integer overflow vulnerability will cause buffer overflow. The research on the relationship between them will help us to detect integer overflow vulnerability. We present a dynamic analysis methods RICB (Run-time Integer Checking via Buffer overflow). Our approach includes decompile execute file to assembly language; debug the execute file step into and step out; locate the overflow points and checking buffer overflow caused by integer overflow. We have implemented our approach in three buffer overflow types: format string overflow, stack overflow and heap overflow. Experiments results show that our approach is effective and efficient. We have detected more than 5 known integer overflow vulnerabilities via buffer overflow.