CN115914178B - VOIP real-time audio and video call method, system and device - Google Patents
VOIP real-time audio and video call method, system and device Download PDFInfo
- Publication number
- CN115914178B CN115914178B CN202310032604.3A CN202310032604A CN115914178B CN 115914178 B CN115914178 B CN 115914178B CN 202310032604 A CN202310032604 A CN 202310032604A CN 115914178 B CN115914178 B CN 115914178B
- Authority
- CN
- China
- Prior art keywords
- terminal
- data stream
- media data
- gateway server
- identity authentication
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 45
- 230000000977 initiatory effect Effects 0.000 claims abstract description 54
- 230000004044 response Effects 0.000 claims abstract description 42
- 230000003993 interaction Effects 0.000 claims description 20
- 238000004590 computer program Methods 0.000 claims description 10
- 230000007958 sleep Effects 0.000 claims description 8
- 210000001503 joint Anatomy 0.000 abstract description 2
- 238000010586 diagram Methods 0.000 description 8
- 230000008569 process Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 239000003999 initiator Substances 0.000 description 4
- 238000004891 communication Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000005059 dormancy Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
Images
Landscapes
- Telephonic Communication Services (AREA)
Abstract
The embodiment of the invention discloses a VOIP real-time audio and video call method, a system and a device, which comprise the following steps: receiving a request instruction from an initiating terminal, carrying out identity authentication on the initiating terminal based on the request instruction, and returning an initiating terminal passing instruction after the identity authentication is passed; sending a video call request to a VOIP gateway server, receiving response information from the VOIP gateway server when a called end goes off-hook, and establishing a call; detecting whether a first media data stream from an initiating terminal or a second media data stream from a VOIP gateway server is received, transcoding the first media data stream and forwarding the first media data stream to the VOIP gateway server when the first media data stream or the second media data stream is received so as to enable the first media data stream to be sent to a called terminal; the second media data stream is forwarded to the originating terminal. By arranging the RTMP gateway server and the VOIP gateway server to respectively butt joint the initiating terminal and the called terminal, the intelligent terminal and the SIP terminal implement an audio-video intercommunication function, and the compatibility is stronger.
Description
Technical Field
The invention relates to the technical field of computer audio and video communication, in particular to a VOIP real-time audio and video call method, a system and a device.
Background
Currently, RTMP is a real-time messaging protocol (Real Time Messaging Protocol). The protocol is based on TCP and is a protocol family, and comprises RTMP basic protocol, RTMPT/RTMPS/RTMPE and other varieties. RTMP is a network protocol designed for real-time data communication, and is mainly used for audio-video and data communication between Flash/AIR platform and streaming media/interactive server supporting RTMP protocol.
VOIP is a voice over IP technology (Voice over Internet Protocol) that refers to a technology that compresses and packages analog voice signals, and then transmits the voice signals in the form of data packets in the environment of an IP network.
In the field of real-time audio-video telephony, there are several implementations: traditional soft phone implementation mode based on SIP+RTP technology; web page implementation mode based on WSS+WebRTC technology; the real-time audio and video call can be realized through the browser, each browser manufacturer provides support for the WebRTC technology in sequence, and the browser manufacturer realizes the problem of a plurality of compatibility caused by the difference; the user experience is also poor.
Disclosure of Invention
Aiming at the defects, the embodiment of the invention discloses a VOIP real-time audio and video call method, a system and a device, which can solve the problem of compatibility of traditional audio and video calls.
The first aspect of the embodiment of the invention discloses a VOIP real-time audio and video call method, which comprises the following steps:
receiving a request instruction from an initiating terminal, carrying out identity authentication on the initiating terminal based on the request instruction, and returning an initiating terminal passing instruction after the identity authentication is passed;
sending a video call request to a VOIP gateway server, receiving ringing response information from the VOIP gateway server when a called end goes off-hook, playing ringing audio and video to an initiating end, and establishing a call;
detecting whether a first media data stream from an initiating terminal or a second media data stream which is from a VOIP gateway server and transcoded by a media coding and decoding module is received, and when the first media data stream or the second media data stream is received, transcoding the first media data stream and forwarding the first media data stream to the VOIP gateway server so that the VOIP gateway server sends the first media data stream to a called terminal; and forwarding the second media data stream to an initiating terminal.
In a first aspect of the embodiment of the present invention, the initiating terminal includes a plug-flow terminal and a pull-flow terminal.
In a first aspect of the embodiment of the present invention, the receiving a request instruction from an initiator, and authenticating the identity of the initiator based on the request instruction includes:
receiving a call request instruction from a push end, carrying out push end identity authentication on the push end, and returning a push end connection success response instruction after the push end identity authentication is passed;
and receiving a data request instruction from the pulling end, carrying out pulling end identity authentication on the pulling end, and returning a pulling end data connection response instruction after the pulling end identity authentication passes.
In a first aspect of the embodiment of the present invention, the receiving a call request instruction from a push end, performing push end identity authentication on the push end, and returning a push end connection success response instruction after the push end identity authentication passes, includes:
receiving handshake and connection requests from a plug-flow end, carrying out first plug-flow end identity authentication on the plug-flow end, and returning a plug-flow end connection success response instruction after the first plug-flow end identity authentication is passed;
receiving a data interaction request from a pushing end, carrying out second pushing end identity authentication on the pushing end, and returning a pushing end interaction response instruction after the second pushing end identity authentication is passed;
and receiving a release request from the pushing end, carrying out third-time pushing end identity authentication on the pushing end, and returning a pushing end release response instruction after the third-time pushing end identity authentication passes.
In a first aspect of the embodiment of the present invention, the receiving a data request instruction from a pull-stream end, performing pull-stream end identity authentication on the pull-stream end, and returning a pull-stream end data connection response instruction after the pull-stream end identity authentication passes, includes:
receiving handshake and connection requests from a pulling end, carrying out first pulling end identity authentication on the pulling end, and returning a pulling end connection success response instruction after the first pulling end identity authentication is passed;
receiving a data interaction request from a pulling end, carrying out second pulling end identity authentication on the pulling end, and returning a pulling end interaction response instruction after the second pulling end identity authentication is passed;
and receiving a play request from the streaming end, carrying out third streaming end identity authentication on the streaming end, and returning a streaming end play response instruction after the third streaming end identity authentication is passed.
As an optional implementation manner, in the first aspect of the embodiment of the present invention, when the first media data stream is not received, the method further includes controlling the session to sleep for a preset duration and counting a preset number of times in the sleep preset duration, continuously detecting whether the first media data stream is received within the preset duration and the preset number of times, and ending the session when the first media data stream is not detected to be received beyond the preset duration or beyond the preset number of times.
The second aspect of the embodiment of the invention discloses a VOIP real-time audio and video call system, which comprises a push terminal, a pull terminal, an RTMP gateway server, a media coding and decoding module and a VOIP gateway server, wherein the push terminal, the pull terminal, the media coding and decoding module and the VOIP gateway server are all connected with the RTMP gateway server, the media coding and decoding module is connected with the VOIP gateway server, the push terminal is used for initiating a call request instruction, the pull terminal is used for initiating a data request instruction, the RTMP gateway server is used for carrying out identity authentication on the push terminal based on the call request instruction, carrying out identity authentication on the pull terminal based on the data request instruction, sending a video call request to the VOIP gateway server, sending a first media data stream from the push terminal to the media coding and decoding module, then forwarding the first media data stream to the VOIP gateway server, and forwarding a second media data stream which is coded and decoded by the media coding and decoding module to the pull terminal.
A third aspect of the embodiment of the present invention discloses a VOIP real-time audio/video call device, including:
the instruction receiving module: the system comprises a receiving module, a receiving module and a sending module, wherein the receiving module is used for receiving a request instruction from an initiating terminal, carrying out identity authentication on the initiating terminal based on the request instruction, and returning an initiating terminal passing instruction after the identity authentication passes;
and a call establishment module: the method comprises the steps of sending a video call request to a VOIP gateway server, receiving ringing response information from the VOIP gateway server when a called terminal goes off-hook, playing ringing audio and video to an initiating terminal, and establishing a call;
and a data interaction module: the method comprises the steps of detecting whether a first media data stream from an initiating terminal or a second media data stream which is from a VOIP gateway server and transcoded by a media coding and decoding module is received, and when the first media data stream or the second media data stream is received, transcoding the first media data stream and forwarding the first media data stream to the VOIP gateway server so that the VOIP gateway server can send the first media data stream to a called terminal; and forwarding the second media data stream to an initiating terminal.
A fourth aspect of an embodiment of the present invention discloses an electronic device, including: a memory storing executable program code; a processor coupled to the memory; the processor invokes the executable program codes stored in the memory to execute the VOIP real-time audio-video call method disclosed in the first aspect of the embodiment of the present invention.
A fifth aspect of the embodiments of the present invention discloses a computer-readable storage medium storing a computer program, where the computer program causes a computer to execute the VOIP real-time audio-video call method disclosed in the first aspect of the embodiments of the present invention.
Compared with the prior art, the embodiment of the invention has the following beneficial effects:
according to the embodiment of the invention, the RTMP gateway server and the VOIP gateway server are respectively abutted to the initiating terminal and the called terminal, so that the implementation of the audio-video intercommunication function between the intelligent terminal and the SIP terminal is realized, the compatibility is stronger, the media server is additionally arranged to encode and decode data without caching the data, the advantage of low delay is realized, whether the media data stream is received during the conversation is monitored, and the effective real-time intercommunication of the media data stream can be realized.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a flow chart of a VOIP real-time audio-video call method according to an embodiment of the present invention;
fig. 2 is a schematic block diagram of a VOIP real-time audio-video call system according to an embodiment of the present invention;
fig. 3 is a schematic flow diagram of a VOIP real-time audio-video call system according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of a VOIP real-time audio-video call device according to an embodiment of the present invention;
fig. 5 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
It should be noted that the terms "first," "second," "third," "fourth," and the like in the description and in the claims of the present invention are used for distinguishing between different objects and not necessarily for describing a particular sequential or chronological order. The terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed or inherent to such process, method, article, or apparatus.
The embodiment of the invention discloses a VOIP real-time audio and video call method, a system, a device, electronic equipment and a storage medium, wherein an initiating terminal and a called terminal are respectively in butt joint by setting an RTMP gateway server and a VOIP gateway server, so that an audio and video intercommunication function between an intelligent terminal and an SIP terminal is realized, the compatibility is stronger, a media server is additionally arranged to encode and decode data without caching the data, the advantage of low delay is realized, whether media data streams are received during the call is monitored, and the effective real-time intercommunication of the media data streams can be realized.
Example 1
Referring to fig. 1, fig. 1 is a flowchart of a VOIP real-time audio-video call method according to an embodiment of the present invention. The execution main body of the method described in the embodiment of the invention is an execution main body composed of software or/and hardware, and the execution main body can receive related information in a wired or/and wireless mode and can send a certain instruction. Of course, it may also have certain processing and storage functions. The execution body may control a plurality of devices, such as a remote physical server or cloud server and related software, or may be a local host or server and related software that performs related operations on a device that is located somewhere, etc. In some scenarios, multiple storage devices may also be controlled, which may be located in the same location or in different locations than the devices. As shown in fig. 1, the VOIP real-time audio-video call method includes the following steps:
101. and receiving a request instruction from the initiating terminal, carrying out identity authentication on the initiating terminal based on the request instruction, and returning an initiating terminal passing instruction after the identity authentication is passed.
The initiating terminal in the embodiment comprises a push terminal and a pull terminal, namely the push terminal and the pull terminal, wherein the push terminal refers to a channel for acquiring audio and video data and comprises a mobile phone/tablet APP, a Web terminal and a PC terminal, and the pull terminal refers to a channel for receiving audio and video data and comprises a mobile phone/tablet APP, a Web terminal and a PC terminal.
Specifically, receiving a request instruction from an initiator, and authenticating the identity of the initiator based on the request instruction, including: receiving a call request instruction from a push end, carrying out push end identity authentication on the push end, and returning a push end connection success response instruction after the push end identity authentication is passed; and receiving a data request instruction from the pulling end, carrying out pulling end identity authentication on the pulling end, and returning a pulling end data connection response instruction after the pulling end identity authentication passes.
For the plug-flow end, specifically, a handshake and a connection request from the plug-flow end are received, the plug-flow end is subjected to first plug-flow end identity authentication, and after the first plug-flow end identity authentication is passed, a plug-flow end connection success response instruction is returned; receiving a data interaction request from a pushing end, carrying out second pushing end identity authentication on the pushing end, and returning a pushing end interaction response instruction after the second pushing end identity authentication is passed; and receiving a release request from the pushing end, carrying out third-time pushing end identity authentication on the pushing end, and returning a pushing end release response instruction after the third-time pushing end identity authentication passes.
For the pull-stream end, the method specifically comprises the steps of receiving handshake and connection requests from the pull-stream end, carrying out first-time pull-stream end identity authentication on the pull-stream end, and returning a pull-stream end connection success response instruction after the first-time pull-stream end identity authentication is passed; receiving a data interaction request from a pulling end, carrying out second pulling end identity authentication on the pulling end, and returning a pulling end interaction response instruction after the second pulling end identity authentication is passed; and receiving a play request from the streaming end, carrying out third streaming end identity authentication on the streaming end, and returning a streaming end play response instruction after the third streaming end identity authentication is passed.
102. And sending a video call request to the VOIP gateway server, receiving ringing response information from the VOIP gateway server when the called terminal goes off-hook, playing ringing audio and video to the initiating terminal, and establishing a call.
The called end refers to terminal equipment and software application programs supporting the VOIP gateway server, such as a physical phone, a soft phone, a web phone, a mobile phone app and the like.
103. Detecting whether a first media data stream from an initiating terminal or a second media data stream which is from a VOIP gateway server and transcoded by a media coding and decoding module is received, and when the first media data stream or the second media data stream is received, transcoding the first media data stream and forwarding the first media data stream to the VOIP gateway server so that the VOIP gateway server sends the first media data stream to a called terminal; and forwarding the second media data stream to an initiating terminal.
Further, when the first media data stream is not received, controlling the conversation to sleep for a preset time length and counting the preset times in the sleep preset time length, continuously detecting whether the first media data stream is received or not in the preset time length and the preset times, and ending the conversation when the first media data stream is not detected to be received after the preset time length or the preset times are exceeded.
Example two
Referring to fig. 2, fig. 2 is a schematic block diagram of a VOIP real-time audio/video call system according to an embodiment of the present invention, and fig. 3 is a schematic block diagram of a working principle of the VOIP real-time audio/video call system according to an embodiment of the present invention, referring to fig. 2 and fig. 3, a push terminal, a pull terminal, an RTMP gateway server, a media codec module and a VOIP gateway server of the system are all connected with the RTMP gateway server, the media codec module and the VOIP gateway server are connected, the push terminal is used for initiating a call request command, the pull terminal is used for initiating a data request command, the RTMP gateway server is used for authenticating the push terminal based on the call request command, authenticating the pull terminal based on the data request command, and transmitting a video call request to the VOIP gateway server, and transmitting a first media data stream from the push terminal to the media codec module, and then transmitting the first media data stream from the push terminal to the media codec module to the VOIP gateway server, and transmitting the second media stream to the VOIP gateway server after the media codec module.
Specifically, the push terminal and the pull terminal respectively send handshakes and connection requests to the RTMP gateway server to apply for authentication, and after the authentication passes, a connection success response is returned. And then the plug-flow terminal sends a data interaction request to the RTMP gateway server, and the RTMP gateway server returns a plug-flow terminal interaction response instruction. The push terminal initiates a release request, and the RTMP gateway server returns a release success response and receives media stream data of the push terminal. The streaming terminal sequentially initiates a handshake and connection request, a data interaction request and a play request, and the RTMP gateway server returns a successful response one by one. Meanwhile, the RTMP gateway server sends a signaling request video call to the VOIP gateway server, when the called party rings, the VOIP gateway server returns to a ringing event, and at the moment, the RTMP gateway server plays ringing audio and video to the streaming terminal to inform the streaming terminal that the called party rings. When the called end goes off hook, the VOIP gateway server returns a response event, at the moment, the RTMP gateway server judges whether the current session has received the push terminal media data stream, if not, the RTMP gateway server judges whether the push terminal media data stream is received or not in a mode of dormancy for M seconds and counting N times. And ending the call if the judgment is failed, and transmitting the RTMP media stream data to a media coding and decoding module for transcoding, transcoding into RTP media stream data and transmitting to the VOIP gateway server if the judgment is successful. Similarly, RTP media stream data from the VOIP gateway server is received, transcoding processing is carried out through a media encoding and decoding module, and the RTP media stream data is forwarded to a stream pulling end after transcoding into RTMP media stream. The calling party and the called party see the image of the other party and hear the sound of the other party in real time, and the real-time intercommunication of media stream data for video call based on RTMP is realized.
The audio codec formats of an embodiment include one or more of G711A/G711U/AAC/SPEEX; the video codec includes: one or more of H263/H264/VP 8.
Example III
Referring to fig. 4, fig. 4 is a schematic structural diagram of a VOIP real-time audio-video call device according to an embodiment of the present invention. As shown in fig. 4, the VOIP real-time audio/video telephony device may include: the system comprises an instruction receiving module 401, a call establishing module 402 and a data interaction module 403, wherein the instruction receiving module 401 is used for: the system comprises a receiving module, a receiving module and a sending module, wherein the receiving module is used for receiving a request instruction from an initiating terminal, carrying out identity authentication on the initiating terminal based on the request instruction, and returning an initiating terminal passing instruction after the identity authentication passes; call establishment module 402: the method comprises the steps of sending a video call request to a VOIP gateway server, receiving ringing response information from the VOIP gateway server when a called terminal goes off-hook, playing ringing audio and video to an initiating terminal, and establishing a call; data interaction module 403: the method comprises the steps of detecting whether a first media data stream from an initiating terminal or a second media data stream which is from a VOIP gateway server and transcoded by a media coding and decoding module is received, and when the first media data stream or the second media data stream is received, transcoding the first media data stream and forwarding the first media data stream to the VOIP gateway server so that the VOIP gateway server can send the first media data stream to a called terminal; and forwarding the second media data stream to an initiating terminal.
In the above, the initiating terminal includes a push-flow terminal and a pull-flow terminal. The receiving the request instruction from the initiating terminal, and authenticating the identity of the initiating terminal based on the request instruction comprises the following steps: receiving a call request instruction from a push end, carrying out push end identity authentication on the push end, and returning a push end connection success response instruction after the push end identity authentication is passed; and receiving a data request instruction from the pulling end, carrying out pulling end identity authentication on the pulling end, and returning a pulling end data connection response instruction after the pulling end identity authentication passes.
The data interaction module 403 further includes controlling the session to sleep for a preset duration and counting a preset number of times within the sleep preset duration when the first media data stream is not received, continuously detecting whether the first media data stream is received within the preset duration and the preset number of times, and ending the session when the first media data stream is not detected to be received beyond the preset duration or beyond the preset number of times.
Example five
Referring to fig. 5, fig. 5 is a schematic structural diagram of an electronic device according to an embodiment of the invention. The electronic device may be a computer, a server, or the like, and of course, may also be an intelligent device such as a mobile phone, a tablet computer, a monitor terminal, or the like, and an image acquisition device having a processing function. As shown in fig. 5, the electronic device may include:
a memory 501 in which executable program codes are stored;
a processor 502 coupled to the memory 501;
the processor 502 invokes executable program codes stored in the memory 501 to execute some or all of the steps in the VOIP real-time audio/video call method in the first embodiment.
The embodiment of the invention discloses a computer readable storage medium which stores a computer program, wherein the computer program enables a computer to execute part or all of the steps in the VOIP real-time audio-video call method in the first embodiment.
The embodiment of the invention also discloses a computer program product, wherein when the computer program product runs on a computer, the computer is caused to execute part or all of the steps in the VOIP real-time audio-video call method in the first embodiment.
The embodiment of the invention also discloses an application release platform, wherein the application release platform is used for releasing a computer program product, and when the computer program product runs on a computer, the computer is caused to execute part or all of the steps in the VOIP real-time audio/video call method in the first embodiment.
In various embodiments of the present invention, it should be understood that the size of the sequence numbers of the processes does not mean that the execution sequence of the processes is necessarily sequential, and the execution sequence of the processes should be determined by the functions and internal logic thereof, and should not constitute any limitation on the implementation process of the embodiments of the present invention.
The units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the embodiment.
In addition, each functional unit in the embodiments of the present invention may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit. The integrated units may be implemented in hardware or in software functional units.
The integrated units, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored in a computer-accessible memory. Based on this understanding, the technical solution of the present invention, or a part contributing to the prior art or all or part of the technical solution, may be embodied in the form of a software product stored in a memory, comprising several requests for a computer device (which may be a personal computer, a server or a network device, etc., in particular may be a processor in a computer device) to execute some or all of the steps of the method according to the embodiments of the present invention.
In the embodiments provided herein, it should be understood that "B corresponding to a" means that B is associated with a, from which B can be determined. It should also be understood that determining B from a does not mean determining B from a alone, but may also determine B from a and/or other information.
Those of ordinary skill in the art will appreciate that some or all of the steps of the various methods of the described embodiments may be implemented by hardware associated with a program that may be stored in a computer-readable storage medium, including Read-Only Memory (ROM), random-access Memory (Random Access Memory, RAM), programmable Read-Only Memory (Programmable Read-Only Memory, PROM), erasable programmable Read-Only Memory (Erasable Programmable Read-Only Memory, EPROM), one-time programmable Read-Only Memory (OTPROM), electrically erasable programmable Read-Only Memory (EEPROM), compact disc Read-Only Memory (Compact Disc Read-Only Memory, CD-ROM), or other optical disk Memory, magnetic disk Memory, tape Memory, or any other medium capable of being used to carry or store data that is readable by a computer.
The VOIP real-time audio and video call method, device, electronic equipment and storage medium disclosed in the embodiments of the present invention are described in detail, and specific examples are applied to the description of the principles and embodiments of the present invention, where the description of the above embodiments is only used to help understand the method and core ideas of the present invention; meanwhile, as those skilled in the art will have variations in the specific embodiments and application scope in accordance with the ideas of the present invention, the present description should not be construed as limiting the present invention in view of the above.
Claims (10)
1. The VOIP real-time audio and video call method is characterized by comprising the following steps:
receiving a request instruction from an initiating terminal, carrying out identity authentication on the initiating terminal based on the request instruction, and returning an initiating terminal passing instruction after the identity authentication is passed; the identity authentication is executed through an RTMP gateway server;
sending a video call request to a VOIP gateway server, receiving ringing response information from the VOIP gateway server when a called end goes off-hook, playing ringing audio and video to an initiating end, and establishing a call;
detecting whether a first media data stream from an initiating terminal or a second media data stream which is from a VOIP gateway server and transcoded by a media coding and decoding module is received, and when the first media data stream or the second media data stream is received, transcoding the first media data stream and forwarding the first media data stream to the VOIP gateway server so that the VOIP gateway server sends the first media data stream to a called terminal; and forwarding the second media data stream to an initiating terminal.
2. The VOIP real-time audio-video call method of claim 1, wherein the originating terminal includes a push-flow terminal and a pull-flow terminal.
3. The VOIP real-time audio-video call method according to claim 2, wherein the receiving the request command from the originating terminal, and authenticating the originating terminal based on the request command, comprises:
receiving a call request instruction from a push end, carrying out push end identity authentication on the push end, and returning a push end connection success response instruction after the push end identity authentication is passed;
and receiving a data request instruction from the pulling end, carrying out pulling end identity authentication on the pulling end, and returning a pulling end data connection response instruction after the pulling end identity authentication passes.
4. The method for VOIP real-time audio/video call according to claim 3, wherein receiving the call request command from the push terminal, and performing push terminal authentication on the push terminal, and returning a push terminal connection success response command after the push terminal authentication is passed, comprises:
receiving handshake and connection requests from a plug-flow end, carrying out first plug-flow end identity authentication on the plug-flow end, and returning a plug-flow end connection success response instruction after the first plug-flow end identity authentication is passed;
receiving a data interaction request from a pushing end, carrying out second pushing end identity authentication on the pushing end, and returning a pushing end interaction response instruction after the second pushing end identity authentication is passed;
and receiving a release request from the pushing end, carrying out third-time pushing end identity authentication on the pushing end, and returning a pushing end release response instruction after the third-time pushing end identity authentication passes.
5. The VOIP real-time audio/video call method according to claim 3, wherein the receiving the data request command from the pull-stream terminal, and performing pull-stream terminal authentication on the pull-stream terminal, returning a pull-stream terminal data connection response command after the pull-stream terminal authentication passes, includes:
receiving handshake and connection requests from a pulling end, carrying out first pulling end identity authentication on the pulling end, and returning a pulling end connection success response instruction after the first pulling end identity authentication is passed;
receiving a data interaction request from a pulling end, carrying out second pulling end identity authentication on the pulling end, and returning a pulling end interaction response instruction after the second pulling end identity authentication is passed;
and receiving a play request from the streaming end, carrying out third streaming end identity authentication on the streaming end, and returning a streaming end play response instruction after the third streaming end identity authentication is passed.
6. The method of claim 1, further comprising controlling a sleep preset duration of the session and counting a preset number of times in the sleep preset duration when the first media data stream is not received, continuously detecting whether the first media data stream is received in the preset duration and the preset number of times, and ending the session when the first media data stream is not detected to be received in the preset duration or the preset number of times is exceeded.
7. The VOIP real-time audio and video call system is characterized by comprising a push terminal, a pull terminal, an RTMP gateway server, a media coding and decoding module and a VOIP gateway server, wherein the push terminal, the pull terminal, the media coding and decoding module and the VOIP gateway server are all connected with the RTMP gateway server, the media coding and decoding module is connected with the VOIP gateway server, the push terminal is used for initiating a call request instruction, the pull terminal is used for initiating a data request instruction, the RTMP gateway server is used for carrying out identity authentication on the push terminal based on the call request instruction, carrying out identity authentication on the pull terminal based on the data request instruction, sending a video call request to the VOIP gateway server, sending a first media data stream from the push terminal to the media coding and decoding module, then forwarding the first media data stream to the VOIP gateway server, and forwarding a second media data stream which is coded and decoded by the media coding and decoding module to the pull terminal.
8. A VOIP real-time audio-video telephony device, comprising:
the instruction receiving module: the system comprises a receiving module, a receiving module and a sending module, wherein the receiving module is used for receiving a request instruction from an initiating terminal, carrying out identity authentication on the initiating terminal based on the request instruction, and returning an initiating terminal passing instruction after the identity authentication passes; the identity authentication is executed through an RTMP gateway server;
and a call establishment module: the method comprises the steps of sending a video call request to a VOIP gateway server, receiving ringing response information from the VOIP gateway server when a called terminal goes off-hook, playing ringing audio and video to an initiating terminal, and establishing a call;
and a data interaction module: the method comprises the steps of detecting whether a first media data stream from an initiating terminal or a second media data stream which is from a VOIP gateway server and transcoded by a media coding and decoding module is received, and when the first media data stream or the second media data stream is received, transcoding the first media data stream and forwarding the first media data stream to the VOIP gateway server so that the VOIP gateway server can send the first media data stream to a called terminal; and forwarding the second media data stream to an initiating terminal.
9. An electronic device, comprising: a memory storing executable program code; a processor coupled to the memory; the processor invokes the executable program code stored in the memory for performing the VOIP real-time audio video call method of any of claims 1 to 6.
10. A computer-readable storage medium storing a computer program, wherein the computer program causes a computer to execute the VOIP real-time audio-video call method of any of claims 1 to 6.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310032604.3A CN115914178B (en) | 2023-01-10 | 2023-01-10 | VOIP real-time audio and video call method, system and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310032604.3A CN115914178B (en) | 2023-01-10 | 2023-01-10 | VOIP real-time audio and video call method, system and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115914178A CN115914178A (en) | 2023-04-04 |
CN115914178B true CN115914178B (en) | 2023-05-02 |
Family
ID=85740850
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310032604.3A Active CN115914178B (en) | 2023-01-10 | 2023-01-10 | VOIP real-time audio and video call method, system and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115914178B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102790710A (en) * | 2011-05-16 | 2012-11-21 | 北京新媒传信科技有限公司 | Method and device for audio and video communication between PC (personal computer) terminal and cell phone |
CN106941629A (en) * | 2017-04-05 | 2017-07-11 | 深圳进门财经科技股份有限公司 | Real-time live broadcast method based on SIP+RTP Yu RTMP protocol interconnections |
CN107819725A (en) * | 2016-09-12 | 2018-03-20 | 山东量子科学技术研究院有限公司 | Method and mobile terminal based on VoIP calls |
CN112533006A (en) * | 2020-11-05 | 2021-03-19 | 深圳市咪码科技有限公司 | Communication method and device for live broadcast platform and VOIP terminal |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7613923B2 (en) * | 2004-02-25 | 2009-11-03 | Watchguard Technologies, Inc. | Method and apparatus for controlling unsolicited messaging in real time messaging networks |
US20120124227A1 (en) * | 2010-11-15 | 2012-05-17 | Nabil Al-Khowaiter | Browser-based voip service method and system |
-
2023
- 2023-01-10 CN CN202310032604.3A patent/CN115914178B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102790710A (en) * | 2011-05-16 | 2012-11-21 | 北京新媒传信科技有限公司 | Method and device for audio and video communication between PC (personal computer) terminal and cell phone |
CN107819725A (en) * | 2016-09-12 | 2018-03-20 | 山东量子科学技术研究院有限公司 | Method and mobile terminal based on VoIP calls |
CN106941629A (en) * | 2017-04-05 | 2017-07-11 | 深圳进门财经科技股份有限公司 | Real-time live broadcast method based on SIP+RTP Yu RTMP protocol interconnections |
CN112533006A (en) * | 2020-11-05 | 2021-03-19 | 深圳市咪码科技有限公司 | Communication method and device for live broadcast platform and VOIP terminal |
Non-Patent Citations (1)
Title |
---|
基于RTMP与SIP的可视化交互系统研究与设计;陈天骢;《CNKI优秀硕士学位论文全文库》;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN115914178A (en) | 2023-04-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7966397B2 (en) | Distributive data capture | |
US7822050B2 (en) | Buffering, pausing and condensing a live phone call | |
CN101909192B (en) | Television terminal and communication method thereof | |
KR20110003491A (en) | Method and apparatus for video service | |
RU2007125542A (en) | METHOD FOR MONITORING VIDEO TELEPHONE SERVICES AND INTENDED FOR THIS SYSTEM | |
WO2014023042A1 (en) | Set top box based video conversation method and system | |
RU2011151783A (en) | SERVICE BACKGROUND BLOCK, AGENCY SYSTEM AND CALL MANAGEMENT METHOD IN SUCH SYSTEM | |
CN101272383B (en) | Real-time audio data transmission method | |
US8493429B2 (en) | Method and terminal for synchronously recording sounds and images of opposite ends based on circuit domain video telephone | |
CN115484235A (en) | Device and method for two-dimension code recognition binding and video interaction | |
US12418570B2 (en) | Real-time and file-based audio data processing | |
CN115914178B (en) | VOIP real-time audio and video call method, system and device | |
WO2021017807A1 (en) | Call connection establishment method, first terminal, server, and storage medium | |
CN101547267B (en) | Network telephony communication integration system and method | |
TWI435589B (en) | Voip integrating system and method thereof | |
CN111787417B (en) | Audio and video transmission control method based on artificial intelligence AI and related equipment | |
CN110602430B (en) | Video synchronization method, terminal, CRBT platform, system and readable storage medium | |
WO2009089097A1 (en) | Methods and apparatus for delivering audio content to a caller placed on hold | |
US20130170404A1 (en) | Control capabilities for information recording sessions | |
CN111865878A (en) | Call method, monitoring device, cloud platform and monitoring system | |
JP2008060752A (en) | Calling method of communication terminal | |
CN105471664A (en) | Automatic packet capturing method and automatic packet capturing device | |
US20110158226A1 (en) | Digital telecommunications system, program product for, and method of managing such a system | |
KR100854883B1 (en) | Caller display method of communication terminal and communication terminal | |
JP2008085808A (en) | Call setting server call method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |