flink启动等待10分钟问题

classic Classic list List threaded Threaded
13 messages Options
Reply | Threaded
Open this post in threaded view
|

flink启动等待10分钟问题

守护
各位Flink社区大佬,您好!
      我使用Flink SQL (Flink 1.8.1),在hadoop 集群中调整了一下yarn-site.xml文件的内存参数:
        <property>
                <name>yarn.nodemanager.resource.memory-mb</name>
                <value>16384</value>
        </property>
        <property>
                <name>yarn.scheduler.minimum-allocation-mb</name>
                <value>1024</value>
        </property>
        <property>
                <name>yarn.scheduler.maximum-allocation-mb</name>
                <value>8192</value>
        </property>



然后我去启动flink on yarn任务,启动参数:./flink-1.8.1/bin/flink run -m yarn-cluster -p 4 -d -c streaming.StreamKafkaMain -yn 4 -ys 2 -yjm 2048 -ytm 8192  -ynm bigdata6 ./flink_bigdata6.jar
最终能能启动成功,就是启动要等待10分钟,这个等待时间太长了,请问各位大佬有什么解决的方法吗?
Reply | Threaded
Open this post in threaded view
|

Re: flink启动等待10分钟问题

Yun Tang
Hi

Flink on YARN作业启动时间长,有很多原因,例如资源不够在等待,container申请的时候又退出了。默认的slot request的timeout时间是5min,感觉你的作业应该是可能遇到了一个slot request timeout,然后又重新申请。最好能提供一下jobmanager的日志才好进一步分析。

祝好
唐云
________________________________
From: 々守护々 <[hidden email]>
Sent: Thursday, August 22, 2019 11:04
To: user-zh <[hidden email]>
Subject: flink启动等待10分钟问题

各位Flink社区大佬,您好!
      我使用Flink SQL (Flink 1.8.1),在hadoop 集群中调整了一下yarn-site.xml文件的内存参数:
        <property>
                <name>yarn.nodemanager.resource.memory-mb</name>
                <value>16384</value>
        </property>
        <property>
                <name>yarn.scheduler.minimum-allocation-mb</name>
                <value>1024</value>
        </property>
        <property>
                <name>yarn.scheduler.maximum-allocation-mb</name>
                <value>8192</value>
        </property>



然后我去启动flink on yarn任务,启动参数:./flink-1.8.1/bin/flink run -m yarn-cluster -p 4 -d -c streaming.StreamKafkaMain -yn 4 -ys 2 -yjm 2048 -ytm 8192  -ynm bigdata6 ./flink_bigdata6.jar
最终能能启动成功,就是启动要等待10分钟,这个等待时间太长了,请问各位大佬有什么解决的方法吗?
Reply | Threaded
Open this post in threaded view
|

回复: flink启动等待10分钟问题

守护
您好,这个是我jobmanager启动日志,请帮忙看看,谢谢!


2019-08-21 20:00:25,428 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - --------------------------------------------------------------------------------
2019-08-21 20:00:25,430 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Registered UNIX signal handlers for [TERM, HUP, INT]
2019-08-21 20:00:25,433 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - YARN daemon is running as: root Yarn client user obtainer: root
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.cluster-id, application_1566383236573_0003
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.address, localhost
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.port, 6123
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.zookeeper.path.root, /flink-yarn
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.storageDir, hdfs://host51:9000/flink/ha-yarn
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.zookeeper.quorum, host51:2181
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: internal.cluster.execution-mode, DETACHED
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability, zookeeper
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: parallelism.default, 1
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: yarn.application-attempts, 10
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.numberOfTaskSlots, 2
2019-08-21 20:00:25,439 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.heap.size, 8192m
2019-08-21 20:00:25,439 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.heap.size, 2048m
2019-08-21 20:00:25,465 WARN  org.apache.flink.configuration.Configuration                  - Config uses deprecated configuration key 'web.port' instead of proper key 'rest.bind-port'
2019-08-21 20:00:25,469 INFO  org.apache.flink.runtime.clusterframework.BootstrapTools      - Setting directories for temporary files to: /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003
2019-08-21 20:00:25,485 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Starting YarnJobClusterEntrypoint.
2019-08-21 20:00:25,485 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Install default filesystem.
2019-08-21 20:00:25,561 INFO  org.apache.flink.runtime.security.modules.HadoopModule        - Hadoop user set to root (auth:SIMPLE)
2019-08-21 20:00:25,580 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Initializing cluster services.
2019-08-21 20:00:25,757 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Trying to start actor system at host53:0
2019-08-21 20:00:26,187 INFO  akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger started
2019-08-21 20:00:26,251 INFO  akka.remote.Remoting                                          - Starting remoting
2019-08-21 20:00:26,439 INFO  akka.remote.Remoting                                          - Remoting started; listening on addresses :[akka.tcp://flink@host53:59609]
2019-08-21 20:00:26,447 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Actor system started at akka.tcp://flink@host53:59609
2019-08-21 20:00:27,011 INFO  org.apache.flink.runtime.blob.FileSystemBlobStore             - Creating highly available BLOB storage directory at hdfs://host51:9000/flink/ha-yarn/application_1566383236573_0003/blob
2019-08-21 20:00:27,097 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Enforcing default ACL for ZK connections
2019-08-21 20:00:27,097 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Using '/flink-yarn/application_1566383236573_0003' as Zookeeper namespace.
2019-08-21 20:00:27,161 INFO  org.apache.flink.shaded.curator.org.apache.curator.framework.imps.CuratorFrameworkImpl  - Starting
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:zookeeper.version=3.4.10-39d3a4f269333c922ed3db283be479f9deacaa0f, built on 03/23/2017 10:13 GMT
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:host.name=host53
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.version=1.8.0_77
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.vendor=Oracle Corporation
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.library.path=:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.io.tmpdir=/tmp
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.compiler=<NA>
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:os.name=Linux
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:os.arch=amd64
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:os.version=3.16.0-0.bpo.4-amd64
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:user.name=root
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:user.home=/root
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:user.dir=/usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/container_1566383236573_0003_01_000001
2019-08-21 20:00:27,171 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Initiating client connection, connectString=host51:2181 sessionTimeout=60000 watcher=org.apache.flink.shaded.curator.org.apache.curator.ConnectionState@319854f0
2019-08-21 20:00:27,184 WARN  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - SASL configuration failed: javax.security.auth.login.LoginException: No JAAS configuration section named 'Client' was found in specified JAAS configuration file: '/tmp/jaas-4274640356638794035.conf'. Will continue connection to Zookeeper server without SASL authentication, if Zookeeper server allows it.
2019-08-21 20:00:27,186 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Opening socket connection to server host51/10.10.10.51:2181
2019-08-21 20:00:27,187 ERROR org.apache.flink.shaded.curator.org.apache.curator.ConnectionState  - Authentication failed
2019-08-21 20:00:27,187 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Socket connection established to host51/10.10.10.51:2181, initiating session
2019-08-21 20:00:27,193 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Session establishment complete on server host51/10.10.10.51:2181, sessionid = 0x1001a86de810008, negotiated timeout = 40000
2019-08-21 20:00:27,193 INFO  org.apache.flink.runtime.blob.BlobServer                      - Created BLOB server storage directory /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-eaa2bb2b-cddb-42e3-8ce5-bf2f83e8567a
2019-08-21 20:00:27,195 INFO  org.apache.flink.shaded.curator.org.apache.curator.framework.state.ConnectionStateManager  - State change: CONNECTED
2019-08-21 20:00:27,198 INFO  org.apache.flink.runtime.blob.BlobServer                      - Started BLOB server at 0.0.0.0:58591 - max concurrent requests: 50 - max backlog: 1000
2019-08-21 20:00:27,213 INFO  org.apache.flink.runtime.metrics.MetricRegistryImpl           - No metrics reporter configured, no metrics will be exposed/reported.
2019-08-21 20:00:27,215 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Trying to start actor system at host53:0
2019-08-21 20:00:27,239 INFO  akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger started
2019-08-21 20:00:27,247 INFO  akka.remote.Remoting                                          - Starting remoting
2019-08-21 20:00:27,255 INFO  akka.remote.Remoting                                          - Remoting started; listening on addresses :[akka.tcp://flink-metrics@host53:59941]
2019-08-21 20:00:27,257 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Actor system started at akka.tcp://flink-metrics@host53:59941
2019-08-21 20:00:27,263 INFO  org.apache.flink.runtime.blob.TransientBlobCache              - Created BLOB cache storage directory /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-405a4797-824f-48da-bb50-2ee2e230d249
2019-08-21 20:00:27,296 WARN  org.apache.flink.configuration.Configuration                  - Config uses deprecated configuration key 'web.port' instead of proper key 'rest.bind-port'
2019-08-21 20:00:27,298 WARN  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Upload directory /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload does not exist, or has been deleted externally. Previously uploaded files are no longer available.
2019-08-21 20:00:27,298 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Created directory /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload for file uploads.
2019-08-21 20:00:27,320 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Starting rest endpoint.
2019-08-21 20:00:27,616 INFO  org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined location of main cluster component log file: /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.log
2019-08-21 20:00:27,616 INFO  org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined location of main cluster component stdout file: /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.out
2019-08-21 20:00:27,771 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Rest endpoint listening at host53:46401
2019-08-21 20:00:27,772 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/rest_server_lock'}.
2019-08-21 20:00:27,795 WARN  org.apache.flink.shaded.curator.org.apache.curator.utils.ZKPaths  - The version of ZooKeeper being used doesn't support Container nodes. CreateMode.PERSISTENT will be used instead.
2019-08-21 20:00:27,807 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Web frontend listening at http://host53:46401.
2019-08-21 20:00:27,836 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - http://host53:46401 was granted leadership with leaderSessionID=8487757f-fd82-42b2-a1a6-adcc69e1eba1
2019-08-21 20:00:27,932 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting RPC endpoint for org.apache.flink.yarn.YarnResourceManager at akka://flink/user/resourcemanager .
2019-08-21 20:00:28,021 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting RPC endpoint for org.apache.flink.runtime.dispatcher.MiniDispatcher at akka://flink/user/dispatcher .
2019-08-21 20:00:28,033 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
2019-08-21 20:00:28,033 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/dispatcher_lock.
2019-08-21 20:00:28,033 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/dispatcher_lock'}.
2019-08-21 20:00:28,045 INFO  org.apache.flink.runtime.dispatcher.MiniDispatcher            - Dispatcher akka.tcp://flink@host53:59609/user/dispatcher was granted leadership with fencing token 6216d17e-56b1-43bb-9bb3-f9fa5b8921be
2019-08-21 20:00:28,048 INFO  org.apache.flink.runtime.dispatcher.MiniDispatcher            - Recovering all persisted jobs.
2019-08-21 20:00:28,054 INFO  org.apache.hadoop.yarn.client.RMProxy                         - Connecting to ResourceManager at host51/10.10.10.51:8030
2019-08-21 20:00:28,078 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting RPC endpoint for org.apache.flink.runtime.jobmaster.JobMaster at akka://flink/user/jobmanager_0 .
2019-08-21 20:00:28,086 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Initializing job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
2019-08-21 20:00:28,099 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Using restart strategy FixedDelayRestartStrategy(maxNumberRestartAttempts=2147483647, delayBetweenRestartAttempts=0) for StreamKafkaMain (e5ea83238d721e932b4645589660406c).
2019-08-21 20:00:28,122 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job recovers via failover strategy: full graph restart
2019-08-21 20:00:28,153 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Running initialization on master for job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
2019-08-21 20:00:28,154 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Successfully ran initialization on master in 0 ms.
2019-08-21 20:00:28,187 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Initialized ZooKeeperCompletedCheckpointStore in '/checkpoints/e5ea83238d721e932b4645589660406c'.
2019-08-21 20:00:28,196 INFO  org.apache.flink.yarn.YarnResourceManager                     - Recovered 0 containers from previous attempts ([]).
2019-08-21 20:00:28,199 INFO  org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  - yarn.client.max-cached-nodemanagers-proxies : 0
2019-08-21 20:00:28,200 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/resource_manager_lock'}.
2019-08-21 20:00:28,203 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - No state backend has been configured, using default (Memory / JobManager) MemoryStateBackend (data in heap memory / checkpoints to JobManager) (checkpoints: 'null', savepoints: 'null', asynchronous: TRUE, maxStateSize: 5242880)
2019-08-21 20:00:28,211 INFO  org.apache.flink.yarn.YarnResourceManager                     - ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager was granted leadership with fencing token 97537c0d61bbf091e37cb6c45b784568
2019-08-21 20:00:28,214 INFO  org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  - Starting the SlotManager.
2019-08-21 20:00:28,225 INFO  org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  - Recovering checkpoints from ZooKeeper.
2019-08-21 20:00:28,229 INFO  org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  - Found 0 checkpoints in ZooKeeper.
2019-08-21 20:00:28,229 INFO  org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  - Trying to fetch 0 checkpoints from storage.
2019-08-21 20:00:28,233 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/e5ea83238d721e932b4645589660406c/job_manager_lock'}.
2019-08-21 20:00:28,248 INFO  org.apache.flink.runtime.jobmaster.JobManagerRunner           - JobManager runner for job StreamKafkaMain (e5ea83238d721e932b4645589660406c) was granted leadership with session id 12c96c94-ebbd-4991-a457-6f0320d71bc1 at akka.tcp://flink@host53:59609/user/jobmanager_0.
2019-08-21 20:00:28,260 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
2019-08-21 20:00:28,260 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Starting execution of job StreamKafkaMain (e5ea83238d721e932b4645589660406c) under job master id a4576f0320d71bc112c96c94ebbd4991.
2019-08-21 20:00:28,261 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job StreamKafkaMain (e5ea83238d721e932b4645589660406c) switched from state CREATED to RUNNING.
2019-08-21 20:00:28,266 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (ee39866835f7b7387f518e352787a532) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,280 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}]
2019-08-21 20:00:28,287 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (f1a893fc5e2440331c8e6db1874ad220) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,288 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}]
2019-08-21 20:00:28,288 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (4756a0450881325af5fb396a18dbb8ae) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}]
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (78f520582607e26b365fca483fc98d4c) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}]
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (0071e1c7c4f70646914329f81dcbd349) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,292 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,295 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Connecting to ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager(97537c0d61bbf091e37cb6c45b784568)
2019-08-21 20:00:28,300 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Resolved ResourceManager address, beginning registration
2019-08-21 20:00:28,300 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Registration at ResourceManager attempt 1 (timeout=100ms)
2019-08-21 20:00:28,304 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/e5ea83238d721e932b4645589660406c/job_manager_lock.
2019-08-21 20:00:28,304 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registering job manager [hidden email]://flink@host53:59609/user/jobmanager_0 for job e5ea83238d721e932b4645589660406c.
2019-08-21 20:00:28,310 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registered job manager [hidden email]://flink@host53:59609/user/jobmanager_0 for job e5ea83238d721e932b4645589660406c.
2019-08-21 20:00:28,313 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - JobManager successfully registered at ResourceManager, leader id: 97537c0d61bbf091e37cb6c45b784568.
2019-08-21 20:00:28,314 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,315 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id faa6a521758e5be4b3dc1955ca0f51b4.
2019-08-21 20:00:28,316 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,316 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,316 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,321 INFO  org.apache.flink.yarn.YarnResourceManager                     - Requesting new TaskExecutor container with resources <memory:8192, vCores:2>. Number pending requests 1.
2019-08-21 20:00:28,322 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id a2b419aea4a373686eeed6c46163a528.
2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id cbc11737ce6729cfe45f24c43180b5c8.
2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager                     - Requesting new TaskExecutor container with resources <memory:8192, vCores:2>. Number pending requests 2.
2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id f6566567c173a55b2bc52b72bcb24ff8.
2019-08-21 20:00:31,597 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint triggering task Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state RUNNING but SCHEDULED instead. Aborting checkpoint.
2019-08-21 20:00:33,749 INFO  org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received new token for : host53:59938
2019-08-21 20:00:33,749 INFO  org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received new token for : host52:33246
2019-08-21 20:00:33,765 INFO  org.apache.flink.yarn.YarnResourceManager                     - Received new container: container_1566383236573_0003_01_000002 - Remaining pending container requests: 2
2019-08-21 20:00:33,765 INFO  org.apache.flink.yarn.YarnResourceManager                     - Removing container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending container requests 1.
2019-08-21 20:00:34,009 INFO  org.apache.flink.yarn.YarnResourceManager                     - Creating container launch context for TaskManagers
2019-08-21 20:00:34,010 INFO  org.apache.flink.yarn.YarnResourceManager                     - Starting TaskManagers
2019-08-21 20:00:34,021 INFO  org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  - Opening proxy : host53:59938
2019-08-21 20:00:34,066 INFO  org.apache.flink.yarn.YarnResourceManager                     - Received new container: container_1566383236573_0003_01_000003 - Remaining pending container requests: 1
2019-08-21 20:00:34,066 INFO  org.apache.flink.yarn.YarnResourceManager                     - Removing container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending container requests 0.
2019-08-21 20:00:34,088 INFO  org.apache.flink.yarn.YarnResourceManager                     - Creating container launch context for TaskManagers
2019-08-21 20:00:34,089 INFO  org.apache.flink.yarn.YarnResourceManager                     - Starting TaskManagers
2019-08-21 20:00:34,090 INFO  org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  - Opening proxy : host52:33246
2019-08-21 20:00:36,597 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint triggering task Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state RUNNING but SCHEDULED instead. Aborting checkpoint.
2019-08-21 20:00:37,856 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registering TaskManager with ResourceID container_1566383236573_0003_01_000002 (akka.tcp://flink@host53:34608/user/taskmanager_0) at ResourceManager
2019-08-21 20:00:39,573 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registering TaskManager with ResourceID container_1566383236573_0003_01_000003 (akka.tcp://flink@host52:32931/user/taskmanager_0) at ResourceManager
2019-08-21 20:00:39,659 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (ee39866835f7b7387f518e352787a532) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,659 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,664 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (f1a893fc5e2440331c8e6db1874ad220) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,664 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,665 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (4756a0450881325af5fb396a18dbb8ae) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (78f520582607e26b365fca483fc98d4c) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (0071e1c7c4f70646914329f81dcbd349) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,670 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,670 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (1/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,671 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (2/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (3/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,673 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (4/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,673 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,673 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (1/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (2/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (3/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,676 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,676 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (4/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,782 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,783 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,784 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,787 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,806 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (f1a893fc5e2440331c8e6db1874ad220) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,809 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (ee39866835f7b7387f518e352787a532) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,809 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (0071e1c7c4f70646914329f81dcbd349) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,991 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,992 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,996 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,997 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:40,005 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (78f520582607e26b365fca483fc98d4c) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:40,006 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (4756a0450881325af5fb396a18dbb8ae) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:41,605 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Triggering checkpoint 1 @ 1566388841597 for job e5ea83238d721e932b4645589660406c.
2019-08-21 20:00:42,466 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Completed checkpoint 1 for job e5ea83238d721e932b4645589660406c (7994 bytes in 821 ms).



------------------ 原始邮件 ------------------
发件人: "Yun Tang"<[hidden email]>;
发送时间: 2019年8月22日(星期四) 中午11:08
收件人: "user-zh"<[hidden email]>;

主题: Re: flink启动等待10分钟问题



Hi

Flink on YARN作业启动时间长,有很多原因,例如资源不够在等待,container申请的时候又退出了。默认的slot request的timeout时间是5min,感觉你的作业应该是可能遇到了一个slot request timeout,然后又重新申请。最好能提供一下jobmanager的日志才好进一步分析。

祝好
唐云
________________________________
From: 々守护々 <[hidden email]>
Sent: Thursday, August 22, 2019 11:04
To: user-zh <[hidden email]>
Subject: flink启动等待10分钟问题

各位Flink社区大佬,您好!
      我使用Flink SQL (Flink 1.8.1),在hadoop 集群中调整了一下yarn-site.xml文件的内存参数:
        <property>
                <name>yarn.nodemanager.resource.memory-mb</name>
                <value>16384</value>
        </property>
        <property>
                <name>yarn.scheduler.minimum-allocation-mb</name>
                <value>1024</value>
        </property>
        <property>
                <name>yarn.scheduler.maximum-allocation-mb</name>
                <value>8192</value>
        </property>



然后我去启动flink on yarn任务,启动参数:./flink-1.8.1/bin/flink run -m yarn-cluster -p 4 -d -c streaming.StreamKafkaMain -yn 4 -ys 2 -yjm 2048 -ytm 8192  -ynm bigdata6 ./flink_bigdata6.jar
最终能能启动成功,就是启动要等待10分钟,这个等待时间太长了,请问各位大佬有什么解决的方法吗?
Reply | Threaded
Open this post in threaded view
|

Re: 回复: flink启动等待10分钟问题

Yun Tang
启动时间是 20:00:25,task都处于running甚至第一次checkpoint completed的时间是20:00:42,一共才17秒,何来10分钟的问题?
________________________________
From: 々守护々 <[hidden email]>
Sent: Thursday, August 22, 2019 11:18
To: user-zh <[hidden email]>
Subject: 回复: flink启动等待10分钟问题

您好,这个是我jobmanager启动日志,请帮忙看看,谢谢!


2019-08-21 20:00:25,428 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - --------------------------------------------------------------------------------
2019-08-21 20:00:25,430 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Registered UNIX signal handlers for [TERM, HUP, INT]
2019-08-21 20:00:25,433 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - YARN daemon is running as: root Yarn client user obtainer: root
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.cluster-id, application_1566383236573_0003
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.address, localhost
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.port, 6123
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.zookeeper.path.root, /flink-yarn
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.storageDir, hdfs://host51:9000/flink/ha-yarn
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.zookeeper.quorum, host51:2181
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: internal.cluster.execution-mode, DETACHED
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability, zookeeper
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: parallelism.default, 1
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: yarn.application-attempts, 10
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.numberOfTaskSlots, 2
2019-08-21 20:00:25,439 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.heap.size, 8192m
2019-08-21 20:00:25,439 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.heap.size, 2048m
2019-08-21 20:00:25,465 WARN  org.apache.flink.configuration.Configuration                  - Config uses deprecated configuration key 'web.port' instead of proper key 'rest.bind-port'
2019-08-21 20:00:25,469 INFO  org.apache.flink.runtime.clusterframework.BootstrapTools      - Setting directories for temporary files to: /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003
2019-08-21 20:00:25,485 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Starting YarnJobClusterEntrypoint.
2019-08-21 20:00:25,485 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Install default filesystem.
2019-08-21 20:00:25,561 INFO  org.apache.flink.runtime.security.modules.HadoopModule        - Hadoop user set to root (auth:SIMPLE)
2019-08-21 20:00:25,580 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Initializing cluster services.
2019-08-21 20:00:25,757 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Trying to start actor system at host53:0
2019-08-21 20:00:26,187 INFO  akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger started
2019-08-21 20:00:26,251 INFO  akka.remote.Remoting                                          - Starting remoting
2019-08-21 20:00:26,439 INFO  akka.remote.Remoting                                          - Remoting started; listening on addresses :[akka.tcp://flink@host53:59609]
2019-08-21 20:00:26,447 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Actor system started at akka.tcp://flink@host53:59609
2019-08-21 20:00:27,011 INFO  org.apache.flink.runtime.blob.FileSystemBlobStore             - Creating highly available BLOB storage directory at hdfs://host51:9000/flink/ha-yarn/application_1566383236573_0003/blob
2019-08-21 20:00:27,097 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Enforcing default ACL for ZK connections
2019-08-21 20:00:27,097 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Using '/flink-yarn/application_1566383236573_0003' as Zookeeper namespace.
2019-08-21 20:00:27,161 INFO  org.apache.flink.shaded.curator.org.apache.curator.framework.imps.CuratorFrameworkImpl  - Starting
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:zookeeper.version=3.4.10-39d3a4f269333c922ed3db283be479f9deacaa0f, built on 03/23/2017 10:13 GMT
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:host.name=host53
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.version=1.8.0_77
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.vendor=Oracle Corporation
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.library.path=:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.io.tmpdir=/tmp
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.compiler=<NA>
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:os.name=Linux
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:os.arch=amd64
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:os.version=3.16.0-0.bpo.4-amd64
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:user.name=root
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:user.home=/root
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:user.dir=/usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/container_1566383236573_0003_01_000001
2019-08-21 20:00:27,171 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Initiating client connection, connectString=host51:2181 sessionTimeout=60000 watcher=org.apache.flink.shaded.curator.org.apache.curator.ConnectionState@319854f0
2019-08-21 20:00:27,184 WARN  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - SASL configuration failed: javax.security.auth.login.LoginException: No JAAS configuration section named 'Client' was found in specified JAAS configuration file: '/tmp/jaas-4274640356638794035.conf'. Will continue connection to Zookeeper server without SASL authentication, if Zookeeper server allows it.
2019-08-21 20:00:27,186 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Opening socket connection to server host51/10.10.10.51:2181
2019-08-21 20:00:27,187 ERROR org.apache.flink.shaded.curator.org.apache.curator.ConnectionState  - Authentication failed
2019-08-21 20:00:27,187 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Socket connection established to host51/10.10.10.51:2181, initiating session
2019-08-21 20:00:27,193 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Session establishment complete on server host51/10.10.10.51:2181, sessionid = 0x1001a86de810008, negotiated timeout = 40000
2019-08-21 20:00:27,193 INFO  org.apache.flink.runtime.blob.BlobServer                      - Created BLOB server storage directory /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-eaa2bb2b-cddb-42e3-8ce5-bf2f83e8567a
2019-08-21 20:00:27,195 INFO  org.apache.flink.shaded.curator.org.apache.curator.framework.state.ConnectionStateManager  - State change: CONNECTED
2019-08-21 20:00:27,198 INFO  org.apache.flink.runtime.blob.BlobServer                      - Started BLOB server at 0.0.0.0:58591 - max concurrent requests: 50 - max backlog: 1000
2019-08-21 20:00:27,213 INFO  org.apache.flink.runtime.metrics.MetricRegistryImpl           - No metrics reporter configured, no metrics will be exposed/reported.
2019-08-21 20:00:27,215 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Trying to start actor system at host53:0
2019-08-21 20:00:27,239 INFO  akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger started
2019-08-21 20:00:27,247 INFO  akka.remote.Remoting                                          - Starting remoting
2019-08-21 20:00:27,255 INFO  akka.remote.Remoting                                          - Remoting started; listening on addresses :[akka.tcp://flink-metrics@host53:59941]
2019-08-21 20:00:27,257 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Actor system started at akka.tcp://flink-metrics@host53:59941
2019-08-21 20:00:27,263 INFO  org.apache.flink.runtime.blob.TransientBlobCache              - Created BLOB cache storage directory /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-405a4797-824f-48da-bb50-2ee2e230d249
2019-08-21 20:00:27,296 WARN  org.apache.flink.configuration.Configuration                  - Config uses deprecated configuration key 'web.port' instead of proper key 'rest.bind-port'
2019-08-21 20:00:27,298 WARN  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Upload directory /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload does not exist, or has been deleted externally. Previously uploaded files are no longer available.
2019-08-21 20:00:27,298 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Created directory /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload for file uploads.
2019-08-21 20:00:27,320 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Starting rest endpoint.
2019-08-21 20:00:27,616 INFO  org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined location of main cluster component log file: /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.log
2019-08-21 20:00:27,616 INFO  org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined location of main cluster component stdout file: /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.out
2019-08-21 20:00:27,771 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Rest endpoint listening at host53:46401
2019-08-21 20:00:27,772 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/rest_server_lock'}.
2019-08-21 20:00:27,795 WARN  org.apache.flink.shaded.curator.org.apache.curator.utils.ZKPaths  - The version of ZooKeeper being used doesn't support Container nodes. CreateMode.PERSISTENT will be used instead.
2019-08-21 20:00:27,807 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Web frontend listening at http://host53:46401.
2019-08-21 20:00:27,836 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - http://host53:46401 was granted leadership with leaderSessionID=8487757f-fd82-42b2-a1a6-adcc69e1eba1
2019-08-21 20:00:27,932 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting RPC endpoint for org.apache.flink.yarn.YarnResourceManager at akka://flink/user/resourcemanager .
2019-08-21 20:00:28,021 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting RPC endpoint for org.apache.flink.runtime.dispatcher.MiniDispatcher at akka://flink/user/dispatcher .
2019-08-21 20:00:28,033 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
2019-08-21 20:00:28,033 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/dispatcher_lock.
2019-08-21 20:00:28,033 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/dispatcher_lock'}.
2019-08-21 20:00:28,045 INFO  org.apache.flink.runtime.dispatcher.MiniDispatcher            - Dispatcher akka.tcp://flink@host53:59609/user/dispatcher was granted leadership with fencing token 6216d17e-56b1-43bb-9bb3-f9fa5b8921be
2019-08-21 20:00:28,048 INFO  org.apache.flink.runtime.dispatcher.MiniDispatcher            - Recovering all persisted jobs.
2019-08-21 20:00:28,054 INFO  org.apache.hadoop.yarn.client.RMProxy                         - Connecting to ResourceManager at host51/10.10.10.51:8030
2019-08-21 20:00:28,078 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting RPC endpoint for org.apache.flink.runtime.jobmaster.JobMaster at akka://flink/user/jobmanager_0 .
2019-08-21 20:00:28,086 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Initializing job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
2019-08-21 20:00:28,099 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Using restart strategy FixedDelayRestartStrategy(maxNumberRestartAttempts=2147483647, delayBetweenRestartAttempts=0) for StreamKafkaMain (e5ea83238d721e932b4645589660406c).
2019-08-21 20:00:28,122 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job recovers via failover strategy: full graph restart
2019-08-21 20:00:28,153 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Running initialization on master for job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
2019-08-21 20:00:28,154 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Successfully ran initialization on master in 0 ms.
2019-08-21 20:00:28,187 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Initialized ZooKeeperCompletedCheckpointStore in '/checkpoints/e5ea83238d721e932b4645589660406c'.
2019-08-21 20:00:28,196 INFO  org.apache.flink.yarn.YarnResourceManager                     - Recovered 0 containers from previous attempts ([]).
2019-08-21 20:00:28,199 INFO  org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  - yarn.client.max-cached-nodemanagers-proxies : 0
2019-08-21 20:00:28,200 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/resource_manager_lock'}.
2019-08-21 20:00:28,203 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - No state backend has been configured, using default (Memory / JobManager) MemoryStateBackend (data in heap memory / checkpoints to JobManager) (checkpoints: 'null', savepoints: 'null', asynchronous: TRUE, maxStateSize: 5242880)
2019-08-21 20:00:28,211 INFO  org.apache.flink.yarn.YarnResourceManager                     - ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager was granted leadership with fencing token 97537c0d61bbf091e37cb6c45b784568
2019-08-21 20:00:28,214 INFO  org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  - Starting the SlotManager.
2019-08-21 20:00:28,225 INFO  org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  - Recovering checkpoints from ZooKeeper.
2019-08-21 20:00:28,229 INFO  org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  - Found 0 checkpoints in ZooKeeper.
2019-08-21 20:00:28,229 INFO  org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  - Trying to fetch 0 checkpoints from storage.
2019-08-21 20:00:28,233 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/e5ea83238d721e932b4645589660406c/job_manager_lock'}.
2019-08-21 20:00:28,248 INFO  org.apache.flink.runtime.jobmaster.JobManagerRunner           - JobManager runner for job StreamKafkaMain (e5ea83238d721e932b4645589660406c) was granted leadership with session id 12c96c94-ebbd-4991-a457-6f0320d71bc1 at akka.tcp://flink@host53:59609/user/jobmanager_0.
2019-08-21 20:00:28,260 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
2019-08-21 20:00:28,260 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Starting execution of job StreamKafkaMain (e5ea83238d721e932b4645589660406c) under job master id a4576f0320d71bc112c96c94ebbd4991.
2019-08-21 20:00:28,261 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job StreamKafkaMain (e5ea83238d721e932b4645589660406c) switched from state CREATED to RUNNING.
2019-08-21 20:00:28,266 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (ee39866835f7b7387f518e352787a532) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,280 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}]
2019-08-21 20:00:28,287 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (f1a893fc5e2440331c8e6db1874ad220) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,288 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}]
2019-08-21 20:00:28,288 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (4756a0450881325af5fb396a18dbb8ae) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}]
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (78f520582607e26b365fca483fc98d4c) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}]
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (0071e1c7c4f70646914329f81dcbd349) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,292 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,295 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Connecting to ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager(97537c0d61bbf091e37cb6c45b784568)
2019-08-21 20:00:28,300 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Resolved ResourceManager address, beginning registration
2019-08-21 20:00:28,300 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Registration at ResourceManager attempt 1 (timeout=100ms)
2019-08-21 20:00:28,304 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/e5ea83238d721e932b4645589660406c/job_manager_lock.
2019-08-21 20:00:28,304 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registering job manager [hidden email]://flink@host53:59609/user/jobmanager_0 for job e5ea83238d721e932b4645589660406c.
2019-08-21 20:00:28,310 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registered job manager [hidden email]://flink@host53:59609/user/jobmanager_0 for job e5ea83238d721e932b4645589660406c.
2019-08-21 20:00:28,313 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - JobManager successfully registered at ResourceManager, leader id: 97537c0d61bbf091e37cb6c45b784568.
2019-08-21 20:00:28,314 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,315 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id faa6a521758e5be4b3dc1955ca0f51b4.
2019-08-21 20:00:28,316 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,316 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,316 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,321 INFO  org.apache.flink.yarn.YarnResourceManager                     - Requesting new TaskExecutor container with resources <memory:8192, vCores:2>. Number pending requests 1.
2019-08-21 20:00:28,322 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id a2b419aea4a373686eeed6c46163a528.
2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id cbc11737ce6729cfe45f24c43180b5c8.
2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager                     - Requesting new TaskExecutor container with resources <memory:8192, vCores:2>. Number pending requests 2.
2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id f6566567c173a55b2bc52b72bcb24ff8.
2019-08-21 20:00:31,597 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint triggering task Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state RUNNING but SCHEDULED instead. Aborting checkpoint.
2019-08-21 20:00:33,749 INFO  org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received new token for : host53:59938
2019-08-21 20:00:33,749 INFO  org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received new token for : host52:33246
2019-08-21 20:00:33,765 INFO  org.apache.flink.yarn.YarnResourceManager                     - Received new container: container_1566383236573_0003_01_000002 - Remaining pending container requests: 2
2019-08-21 20:00:33,765 INFO  org.apache.flink.yarn.YarnResourceManager                     - Removing container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending container requests 1.
2019-08-21 20:00:34,009 INFO  org.apache.flink.yarn.YarnResourceManager                     - Creating container launch context for TaskManagers
2019-08-21 20:00:34,010 INFO  org.apache.flink.yarn.YarnResourceManager                     - Starting TaskManagers
2019-08-21 20:00:34,021 INFO  org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  - Opening proxy : host53:59938
2019-08-21 20:00:34,066 INFO  org.apache.flink.yarn.YarnResourceManager                     - Received new container: container_1566383236573_0003_01_000003 - Remaining pending container requests: 1
2019-08-21 20:00:34,066 INFO  org.apache.flink.yarn.YarnResourceManager                     - Removing container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending container requests 0.
2019-08-21 20:00:34,088 INFO  org.apache.flink.yarn.YarnResourceManager                     - Creating container launch context for TaskManagers
2019-08-21 20:00:34,089 INFO  org.apache.flink.yarn.YarnResourceManager                     - Starting TaskManagers
2019-08-21 20:00:34,090 INFO  org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  - Opening proxy : host52:33246
2019-08-21 20:00:36,597 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint triggering task Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state RUNNING but SCHEDULED instead. Aborting checkpoint.
2019-08-21 20:00:37,856 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registering TaskManager with ResourceID container_1566383236573_0003_01_000002 (akka.tcp://flink@host53:34608/user/taskmanager_0) at ResourceManager
2019-08-21 20:00:39,573 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registering TaskManager with ResourceID container_1566383236573_0003_01_000003 (akka.tcp://flink@host52:32931/user/taskmanager_0) at ResourceManager
2019-08-21 20:00:39,659 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (ee39866835f7b7387f518e352787a532) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,659 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,664 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (f1a893fc5e2440331c8e6db1874ad220) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,664 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,665 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (4756a0450881325af5fb396a18dbb8ae) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (78f520582607e26b365fca483fc98d4c) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (0071e1c7c4f70646914329f81dcbd349) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,670 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,670 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (1/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,671 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (2/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (3/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,673 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (4/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,673 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,673 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (1/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (2/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (3/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,676 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,676 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (4/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,782 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,783 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,784 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,787 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,806 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (f1a893fc5e2440331c8e6db1874ad220) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,809 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (ee39866835f7b7387f518e352787a532) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,809 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (0071e1c7c4f70646914329f81dcbd349) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,991 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,992 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,996 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,997 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:40,005 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (78f520582607e26b365fca483fc98d4c) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:40,006 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (4756a0450881325af5fb396a18dbb8ae) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:41,605 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Triggering checkpoint 1 @ 1566388841597 for job e5ea83238d721e932b4645589660406c.
2019-08-21 20:00:42,466 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Completed checkpoint 1 for job e5ea83238d721e932b4645589660406c (7994 bytes in 821 ms).



------------------ 原始邮件 ------------------
发件人: "Yun Tang"<[hidden email]>;
发送时间: 2019年8月22日(星期四) 中午11:08
收件人: "user-zh"<[hidden email]>;

主题: Re: flink启动等待10分钟问题



Hi

Flink on YARN作业启动时间长,有很多原因,例如资源不够在等待,container申请的时候又退出了。默认的slot request的timeout时间是5min,感觉你的作业应该是可能遇到了一个slot request timeout,然后又重新申请。最好能提供一下jobmanager的日志才好进一步分析。

祝好
唐云
________________________________
From: 々守护々 <[hidden email]>
Sent: Thursday, August 22, 2019 11:04
To: user-zh <[hidden email]>
Subject: flink启动等待10分钟问题

各位Flink社区大佬,您好!
      我使用Flink SQL (Flink 1.8.1),在hadoop 集群中调整了一下yarn-site.xml文件的内存参数:
        <property>
                <name>yarn.nodemanager.resource.memory-mb</name>
                <value>16384</value>
        </property>
        <property>
                <name>yarn.scheduler.minimum-allocation-mb</name>
                <value>1024</value>
        </property>
        <property>
                <name>yarn.scheduler.maximum-allocation-mb</name>
                <value>8192</value>
        </property>



然后我去启动flink on yarn任务,启动参数:./flink-1.8.1/bin/flink run -m yarn-cluster -p 4 -d -c streaming.StreamKafkaMain -yn 4 -ys 2 -yjm 2048 -ytm 8192  -ynm bigdata6 ./flink_bigdata6.jar
最终能能启动成功,就是启动要等待10分钟,这个等待时间太长了,请问各位大佬有什么解决的方法吗?
Reply | Threaded
Open this post in threaded view
|

Re: flink启动等待10分钟问题

tison
In reply to this post by 守护
Hi,

从你的 LOG 里看从 Cluster 起来

2019-08-21 20:00:25,485 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint
       - Starting YarnJobClusterEntrypoint.

到全部 RUNNING

2019-08-21 20:00:40,006 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph
      - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
(4756a0450881325af5fb396a18dbb8ae) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:41,605 INFO
org.apache.flink.runtime.checkpoint.CheckpointCoordinator
   - Triggering checkpoint 1 @ 1566388841597 for job
e5ea83238d721e932b4645589660406c.

一共经过了 16 秒,你观察到的 10 分钟是从执行 flink run 之后的时间吗?可能是提交到 YARN 上的过程网络延迟高?

Best,
tison.


々守护々 <[hidden email]> 于2019年8月22日周四 上午11:18写道:

> 您好,这个是我jobmanager启动日志,请帮忙看看,谢谢!
>
>
> 2019-08-21 20:00:25,428 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -
> --------------------------------------------------------------------------------
> 2019-08-21 20:00:25,430 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Registered
> UNIX signal handlers for [TERM, HUP, INT]
> 2019-08-21 20:00:25,433 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - YARN daemon
> is running as: root Yarn client user obtainer: root
> 2019-08-21 20:00:25,437 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: high-availability.cluster-id,
> application_1566383236573_0003
> 2019-08-21 20:00:25,437 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.rpc.address, localhost
> 2019-08-21 20:00:25,437 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.rpc.port, 6123
> 2019-08-21 20:00:25,437 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: high-availability.zookeeper.path.root, /flink-yarn
> 2019-08-21 20:00:25,437 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: high-availability.storageDir,
> hdfs://host51:9000/flink/ha-yarn
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: high-availability.zookeeper.quorum, host51:2181
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: internal.cluster.execution-mode, DETACHED
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: high-availability, zookeeper
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: parallelism.default, 1
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: yarn.application-attempts, 10
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.numberOfTaskSlots, 2
> 2019-08-21 20:00:25,439 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.heap.size, 8192m
> 2019-08-21 20:00:25,439 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.heap.size, 2048m
> 2019-08-21 20:00:25,465 WARN
> org.apache.flink.configuration.Configuration                  - Config uses
> deprecated configuration key 'web.port' instead of proper key
> 'rest.bind-port'
> 2019-08-21 20:00:25,469 INFO
> org.apache.flink.runtime.clusterframework.BootstrapTools      - Setting
> directories for temporary files to:
> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003
> 2019-08-21 20:00:25,485 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Starting
> YarnJobClusterEntrypoint.
> 2019-08-21 20:00:25,485 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Install
> default filesystem.
> 2019-08-21 20:00:25,561 INFO
> org.apache.flink.runtime.security.modules.HadoopModule        - Hadoop user
> set to root (auth:SIMPLE)
> 2019-08-21 20:00:25,580 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -
> Initializing cluster services.
> 2019-08-21 20:00:25,757 INFO
> org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Trying to
> start actor system at host53:0
> 2019-08-21 20:00:26,187 INFO  akka.event.slf4j.Slf4jLogger
>                   - Slf4jLogger started
> 2019-08-21 20:00:26,251 INFO  akka.remote.Remoting
>                   - Starting remoting
> 2019-08-21 20:00:26,439 INFO  akka.remote.Remoting
>                   - Remoting started; listening on addresses
> :[akka.tcp://flink@host53:59609]
> 2019-08-21 20:00:26,447 INFO
> org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Actor
> system started at akka.tcp://flink@host53:59609
> 2019-08-21 20:00:27,011 INFO
> org.apache.flink.runtime.blob.FileSystemBlobStore             - Creating
> highly available BLOB storage directory at
> hdfs://host51:9000/flink/ha-yarn/application_1566383236573_0003/blob
> 2019-08-21 20:00:27,097 INFO
> org.apache.flink.runtime.util.ZooKeeperUtils                  - Enforcing
> default ACL for ZK connections
> 2019-08-21 20:00:27,097 INFO
> org.apache.flink.runtime.util.ZooKeeperUtils                  - Using
> '/flink-yarn/application_1566383236573_0003' as Zookeeper namespace.
> 2019-08-21 20:00:27,161 INFO
> org.apache.flink.shaded.curator.org.apache.curator.framework.imps.CuratorFrameworkImpl
> - Starting
> 2019-08-21 20:00:27,169 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:zookeeper.version=3.4.10-39d3a4f269333c922ed3db283be479f9deacaa0f,
> built on 03/23/2017 10:13 GMT
> 2019-08-21 20:00:27,169 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:host.name=host53
> 2019-08-21 20:00:27,169 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:java.version=1.8.0_77
> 2019-08-21 20:00:27,169 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:java.vendor=Oracle Corporation
> 2019-08-21 20:00:27,169 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:java.library.path=:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:java.io.tmpdir=/tmp
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:java.compiler=<NA>
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:os.name=Linux
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:os.arch=amd64
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:os.version=3.16.0-0.bpo.4-amd64
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:user.name=root
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:user.home=/root
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:user.dir=/usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/container_1566383236573_0003_01_000001
> 2019-08-21 20:00:27,171 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Initiating client connection, connectString=host51:2181
> sessionTimeout=60000
> watcher=org.apache.flink.shaded.curator.org.apache.curator.ConnectionState@319854f0
> 2019-08-21 20:00:27,184 WARN
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - SASL
> configuration failed: javax.security.auth.login.LoginException: No JAAS
> configuration section named 'Client' was found in specified JAAS
> configuration file: '/tmp/jaas-4274640356638794035.conf'. Will continue
> connection to Zookeeper server without SASL authentication, if Zookeeper
> server allows it.
> 2019-08-21 20:00:27,186 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  -
> Opening socket connection to server host51/10.10.10.51:2181
> 2019-08-21 20:00:27,187 ERROR
> org.apache.flink.shaded.curator.org.apache.curator.ConnectionState  -
> Authentication failed
> 2019-08-21 20:00:27,187 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Socket
> connection established to host51/10.10.10.51:2181, initiating session
> 2019-08-21 20:00:27,193 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  -
> Session establishment complete on server host51/10.10.10.51:2181,
> sessionid = 0x1001a86de810008, negotiated timeout = 40000
> 2019-08-21 20:00:27,193 INFO  org.apache.flink.runtime.blob.BlobServer
>                   - Created BLOB server storage directory
> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-eaa2bb2b-cddb-42e3-8ce5-bf2f83e8567a
> 2019-08-21 20:00:27,195 INFO
> org.apache.flink.shaded.curator.org.apache.curator.framework.state.ConnectionStateManager
> - State change: CONNECTED
> 2019-08-21 20:00:27,198 INFO  org.apache.flink.runtime.blob.BlobServer
>                   - Started BLOB server at 0.0.0.0:58591 - max concurrent
> requests: 50 - max backlog: 1000
> 2019-08-21 20:00:27,213 INFO
> org.apache.flink.runtime.metrics.MetricRegistryImpl           - No metrics
> reporter configured, no metrics will be exposed/reported.
> 2019-08-21 20:00:27,215 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Trying to
> start actor system at host53:0
> 2019-08-21 20:00:27,239 INFO  akka.event.slf4j.Slf4jLogger
>                   - Slf4jLogger started
> 2019-08-21 20:00:27,247 INFO  akka.remote.Remoting
>                   - Starting remoting
> 2019-08-21 20:00:27,255 INFO  akka.remote.Remoting
>                   - Remoting started; listening on addresses
> :[akka.tcp://flink-metrics@host53:59941]
> 2019-08-21 20:00:27,257 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Actor
> system started at akka.tcp://flink-metrics@host53:59941
> 2019-08-21 20:00:27,263 INFO
> org.apache.flink.runtime.blob.TransientBlobCache              - Created
> BLOB cache storage directory
> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-405a4797-824f-48da-bb50-2ee2e230d249
> 2019-08-21 20:00:27,296 WARN
> org.apache.flink.configuration.Configuration                  - Config uses
> deprecated configuration key 'web.port' instead of proper key
> 'rest.bind-port'
> 2019-08-21 20:00:27,298 WARN
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Upload
> directory
> /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload does
> not exist, or has been deleted externally. Previously uploaded files are no
> longer available.
> 2019-08-21 20:00:27,298 INFO
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Created
> directory
> /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload for
> file uploads.
> 2019-08-21 20:00:27,320 INFO
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Starting
> rest endpoint.
> 2019-08-21 20:00:27,616 INFO
> org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined
> location of main cluster component log file:
> /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.log
> 2019-08-21 20:00:27,616 INFO
> org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined
> location of main cluster component stdout file:
> /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.out
> 2019-08-21 20:00:27,771 INFO
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Rest
> endpoint listening at host53:46401
> 2019-08-21 20:00:27,772 INFO
> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
> Starting ZooKeeperLeaderElectionService
> ZooKeeperLeaderElectionService{leaderPath='/leader/rest_server_lock'}.
> 2019-08-21 20:00:27,795 WARN
> org.apache.flink.shaded.curator.org.apache.curator.utils.ZKPaths  - The
> version of ZooKeeper being used doesn't support Container nodes.
> CreateMode.PERSISTENT will be used instead.
> 2019-08-21 20:00:27,807 INFO
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Web
> frontend listening at http://host53:46401.
> 2019-08-21 20:00:27,836 INFO
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  -
> http://host53:46401 was granted leadership with
> leaderSessionID=8487757f-fd82-42b2-a1a6-adcc69e1eba1
> 2019-08-21 20:00:27,932 INFO
> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
> RPC endpoint for org.apache.flink.yarn.YarnResourceManager at
> akka://flink/user/resourcemanager .
> 2019-08-21 20:00:28,021 INFO
> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
> RPC endpoint for org.apache.flink.runtime.dispatcher.MiniDispatcher at
> akka://flink/user/dispatcher .
> 2019-08-21 20:00:28,033 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
> 2019-08-21 20:00:28,033 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService /leader/dispatcher_lock.
> 2019-08-21 20:00:28,033 INFO
> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
> Starting ZooKeeperLeaderElectionService
> ZooKeeperLeaderElectionService{leaderPath='/leader/dispatcher_lock'}.
> 2019-08-21 20:00:28,045 INFO
> org.apache.flink.runtime.dispatcher.MiniDispatcher            - Dispatcher
> akka.tcp://flink@host53:59609/user/dispatcher was granted leadership with
> fencing token 6216d17e-56b1-43bb-9bb3-f9fa5b8921be
> 2019-08-21 20:00:28,048 INFO
> org.apache.flink.runtime.dispatcher.MiniDispatcher            - Recovering
> all persisted jobs.
> 2019-08-21 20:00:28,054 INFO  org.apache.hadoop.yarn.client.RMProxy
>                  - Connecting to ResourceManager at host51/
> 10.10.10.51:8030
> 2019-08-21 20:00:28,078 INFO
> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
> RPC endpoint for org.apache.flink.runtime.jobmaster.JobMaster at
> akka://flink/user/jobmanager_0 .
> 2019-08-21 20:00:28,086 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  -
> Initializing job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
> 2019-08-21 20:00:28,099 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - Using
> restart strategy
> FixedDelayRestartStrategy(maxNumberRestartAttempts=2147483647,
> delayBetweenRestartAttempts=0) for StreamKafkaMain
> (e5ea83238d721e932b4645589660406c).
> 2019-08-21 20:00:28,122 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job
> recovers via failover strategy: full graph restart
> 2019-08-21 20:00:28,153 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - Running
> initialization on master for job StreamKafkaMain
> (e5ea83238d721e932b4645589660406c).
> 2019-08-21 20:00:28,154 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  -
> Successfully ran initialization on master in 0 ms.
> 2019-08-21 20:00:28,187 INFO
> org.apache.flink.runtime.util.ZooKeeperUtils                  - Initialized
> ZooKeeperCompletedCheckpointStore in
> '/checkpoints/e5ea83238d721e932b4645589660406c'.
> 2019-08-21 20:00:28,196 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Recovered 0 containers from previous attempts ([]).
> 2019-08-21 20:00:28,199 INFO
> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
> yarn.client.max-cached-nodemanagers-proxies : 0
> 2019-08-21 20:00:28,200 INFO
> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
> Starting ZooKeeperLeaderElectionService
> ZooKeeperLeaderElectionService{leaderPath='/leader/resource_manager_lock'}.
> 2019-08-21 20:00:28,203 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - No state
> backend has been configured, using default (Memory / JobManager)
> MemoryStateBackend (data in heap memory / checkpoints to JobManager)
> (checkpoints: 'null', savepoints: 'null', asynchronous: TRUE, maxStateSize:
> 5242880)
> 2019-08-21 20:00:28,211 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager
> was granted leadership with fencing token 97537c0d61bbf091e37cb6c45b784568
> 2019-08-21 20:00:28,214 INFO
> org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  -
> Starting the SlotManager.
> 2019-08-21 20:00:28,225 INFO
> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
> Recovering checkpoints from ZooKeeper.
> 2019-08-21 20:00:28,229 INFO
> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
> Found 0 checkpoints in ZooKeeper.
> 2019-08-21 20:00:28,229 INFO
> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
> Trying to fetch 0 checkpoints from storage.
> 2019-08-21 20:00:28,233 INFO
> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
> Starting ZooKeeperLeaderElectionService
> ZooKeeperLeaderElectionService{leaderPath='/leader/e5ea83238d721e932b4645589660406c/job_manager_lock'}.
> 2019-08-21 20:00:28,248 INFO
> org.apache.flink.runtime.jobmaster.JobManagerRunner           - JobManager
> runner for job StreamKafkaMain (e5ea83238d721e932b4645589660406c) was
> granted leadership with session id 12c96c94-ebbd-4991-a457-6f0320d71bc1 at
> akka.tcp://flink@host53:59609/user/jobmanager_0.
> 2019-08-21 20:00:28,260 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
> 2019-08-21 20:00:28,260 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - Starting
> execution of job StreamKafkaMain (e5ea83238d721e932b4645589660406c) under
> job master id a4576f0320d71bc112c96c94ebbd4991.
> 2019-08-21 20:00:28,261 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job
> StreamKafkaMain (e5ea83238d721e932b4645589660406c) switched from state
> CREATED to RUNNING.
> 2019-08-21 20:00:28,266 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
> (ee39866835f7b7387f518e352787a532) switched from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,280 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> serve slot request, no ResourceManager connected. Adding as pending request
> [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}]
> 2019-08-21 20:00:28,287 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
> (f1a893fc5e2440331c8e6db1874ad220) switched from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,288 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> serve slot request, no ResourceManager connected. Adding as pending request
> [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}]
> 2019-08-21 20:00:28,288 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
> (4756a0450881325af5fb396a18dbb8ae) switched from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,289 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> serve slot request, no ResourceManager connected. Adding as pending request
> [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}]
> 2019-08-21 20:00:28,289 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
> (78f520582607e26b365fca483fc98d4c) switched from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,289 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> serve slot request, no ResourceManager connected. Adding as pending request
> [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}]
> 2019-08-21 20:00:28,289 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select:
> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
> (0071e1c7c4f70646914329f81dcbd349) switched from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from CREATED
> to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from CREATED
> to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from CREATED
> to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from CREATED
> to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched
> from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched
> from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched
> from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,292 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched
> from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,295 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - Connecting
> to ResourceManager akka.tcp://flink@host53
> :59609/user/resourcemanager(97537c0d61bbf091e37cb6c45b784568)
> 2019-08-21 20:00:28,300 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - Resolved
> ResourceManager address, beginning registration
> 2019-08-21 20:00:28,300 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  -
> Registration at ResourceManager attempt 1 (timeout=100ms)
> 2019-08-21 20:00:28,304 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService
> /leader/e5ea83238d721e932b4645589660406c/job_manager_lock.
> 2019-08-21 20:00:28,304 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Registering job manager
> [hidden email]://flink@host53:59609/user/jobmanager_0
> for job e5ea83238d721e932b4645589660406c.
> 2019-08-21 20:00:28,310 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Registered job manager a4576f0320d71bc112c96c94ebbd4991
> @akka.tcp://flink@host53:59609/user/jobmanager_0 for job
> e5ea83238d721e932b4645589660406c.
> 2019-08-21 20:00:28,313 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - JobManager
> successfully registered at ResourceManager, leader id:
> 97537c0d61bbf091e37cb6c45b784568.
> 2019-08-21 20:00:28,314 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
> new slot [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}] and profile
> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> 2019-08-21 20:00:28,315 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Request slot with profile ResourceProfile{cpuCores=-1.0,
> heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0,
> networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with
> allocation id faa6a521758e5be4b3dc1955ca0f51b4.
> 2019-08-21 20:00:28,316 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
> new slot [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}] and profile
> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> 2019-08-21 20:00:28,316 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
> new slot [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}] and profile
> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> 2019-08-21 20:00:28,316 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
> new slot [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}] and profile
> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> 2019-08-21 20:00:28,321 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Requesting new TaskExecutor container with resources
> <memory:8192, vCores:2>. Number pending requests 1.
> 2019-08-21 20:00:28,322 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Request slot with profile ResourceProfile{cpuCores=-1.0,
> heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0,
> networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with
> allocation id a2b419aea4a373686eeed6c46163a528.
> 2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Request slot with profile ResourceProfile{cpuCores=-1.0,
> heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0,
> networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with
> allocation id cbc11737ce6729cfe45f24c43180b5c8.
> 2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Requesting new TaskExecutor container with resources
> <memory:8192, vCores:2>. Number pending requests 2.
> 2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Request slot with profile ResourceProfile{cpuCores=-1.0,
> heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0,
> networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with
> allocation id f6566567c173a55b2bc52b72bcb24ff8.
> 2019-08-21 20:00:31,597 INFO
> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint
> triggering task Source: Custom Source -> Flat Map -> Filter -> Filter ->
> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong,
> rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute:
> (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state
> RUNNING but SCHEDULED instead. Aborting checkpoint.
> 2019-08-21 20:00:33,749 INFO
> org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received
> new token for : host53:59938
> 2019-08-21 20:00:33,749 INFO
> org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received
> new token for : host52:33246
> 2019-08-21 20:00:33,765 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Received new container:
> container_1566383236573_0003_01_000002 - Remaining pending container
> requests: 2
> 2019-08-21 20:00:33,765 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Removing container request Capability[<memory:8192,
> vCores:2>]Priority[1]. Pending container requests 1.
> 2019-08-21 20:00:34,009 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Creating container launch context for TaskManagers
> 2019-08-21 20:00:34,010 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Starting TaskManagers
> 2019-08-21 20:00:34,021 INFO
> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
> Opening proxy : host53:59938
> 2019-08-21 20:00:34,066 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Received new container:
> container_1566383236573_0003_01_000003 - Remaining pending container
> requests: 1
> 2019-08-21 20:00:34,066 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Removing container request Capability[<memory:8192,
> vCores:2>]Priority[1]. Pending container requests 0.
> 2019-08-21 20:00:34,088 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Creating container launch context for TaskManagers
> 2019-08-21 20:00:34,089 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Starting TaskManagers
> 2019-08-21 20:00:34,090 INFO
> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
> Opening proxy : host52:33246
> 2019-08-21 20:00:36,597 INFO
> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint
> triggering task Source: Custom Source -> Flat Map -> Filter -> Filter ->
> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong,
> rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute:
> (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state
> RUNNING but SCHEDULED instead. Aborting checkpoint.
> 2019-08-21 20:00:37,856 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Registering TaskManager with ResourceID
> container_1566383236573_0003_01_000002 (akka.tcp://flink@host53:34608/user/taskmanager_0)
> at ResourceManager
> 2019-08-21 20:00:39,573 INFO  org.apache.flink.yarn.YarnResourceManager
>                  - Registering TaskManager with ResourceID
> container_1566383236573_0003_01_000003 (akka.tcp://flink@host52:32931/user/taskmanager_0)
> at ResourceManager
> 2019-08-21 20:00:39,659 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
> (ee39866835f7b7387f518e352787a532) switched from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,659 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
> (attempt #0) to container_1566383236573_0003_01_000002 @ host53
> (dataPort=52456)
> 2019-08-21 20:00:39,664 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
> (f1a893fc5e2440331c8e6db1874ad220) switched from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,664 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
> (attempt #0) to container_1566383236573_0003_01_000002 @ host53
> (dataPort=52456)
> 2019-08-21 20:00:39,665 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
> (4756a0450881325af5fb396a18dbb8ae) switched from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,666 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
> (attempt #0) to container_1566383236573_0003_01_000003 @ host52
> (dataPort=39650)
> 2019-08-21 20:00:39,666 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
> (78f520582607e26b365fca483fc98d4c) switched from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,666 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
> (attempt #0) to container_1566383236573_0003_01_000003 @ host52
> (dataPort=39650)
> 2019-08-21 20:00:39,666 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select:
> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
> (0071e1c7c4f70646914329f81dcbd349) switched from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,666 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select:
> (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start,
> end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime)
> -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (attempt
> #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> 2019-08-21 20:00:39,670 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from
> SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,670 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Sink: Print to Std. Out (1/4) (attempt #0) to
> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> 2019-08-21 20:00:39,671 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from
> SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,672 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Sink: Print to Std. Out (2/4) (attempt #0) to
> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> 2019-08-21 20:00:39,672 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from
> SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,672 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Sink: Print to Std. Out (3/4) (attempt #0) to
> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> 2019-08-21 20:00:39,672 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from
> SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,673 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Sink: Print to Std. Out (4/4) (attempt #0) to
> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> 2019-08-21 20:00:39,673 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched
> from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,673 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Map -> Sink: Print to Std. Out (1/4) (attempt #0) to
> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> 2019-08-21 20:00:39,675 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched
> from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,675 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Map -> Sink: Print to Std. Out (2/4) (attempt #0) to
> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> 2019-08-21 20:00:39,675 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched
> from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,675 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Map -> Sink: Print to Std. Out (3/4) (attempt #0) to
> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> 2019-08-21 20:00:39,676 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched
> from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,676 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Map -> Sink: Print to Std. Out (4/4) (attempt #0) to
> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> 2019-08-21 20:00:39,782 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from
> DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,783 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from
> DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,784 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched
> from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,787 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched
> from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,806 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
> (f1a893fc5e2440331c8e6db1874ad220) switched from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,809 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
> (ee39866835f7b7387f518e352787a532) switched from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,809 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select:
> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
> (0071e1c7c4f70646914329f81dcbd349) switched from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,991 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from
> DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,992 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from
> DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,996 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched
> from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,997 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched
> from DEPLOYING to RUNNING.
> 2019-08-21 20:00:40,005 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
> (78f520582607e26b365fca483fc98d4c) switched from DEPLOYING to RUNNING.
> 2019-08-21 20:00:40,006 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
> (4756a0450881325af5fb396a18dbb8ae) switched from DEPLOYING to RUNNING.
> 2019-08-21 20:00:41,605 INFO
> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Triggering
> checkpoint 1 @ 1566388841597 for job e5ea83238d721e932b4645589660406c.
> 2019-08-21 20:00:42,466 INFO
> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Completed
> checkpoint 1 for job e5ea83238d721e932b4645589660406c (7994 bytes in 821
> ms).
>
>
>
> ------------------ 原始邮件 ------------------
> 发件人: "Yun Tang"<[hidden email]>;
> 发送时间: 2019年8月22日(星期四) 中午11:08
> 收件人: "user-zh"<[hidden email]>;
>
> 主题: Re: flink启动等待10分钟问题
>
>
>
> Hi
>
> Flink on YARN作业启动时间长,有很多原因,例如资源不够在等待,container申请的时候又退出了。默认的slot
> request的timeout时间是5min,感觉你的作业应该是可能遇到了一个slot request
> timeout,然后又重新申请。最好能提供一下jobmanager的日志才好进一步分析。
>
> 祝好
> 唐云
> ________________________________
> From: 々守护々 <[hidden email]>
> Sent: Thursday, August 22, 2019 11:04
> To: user-zh <[hidden email]>
> Subject: flink启动等待10分钟问题
>
> 各位Flink社区大佬,您好!
>       我使用Flink SQL (Flink 1.8.1),在hadoop 集群中调整了一下yarn-site.xml文件的内存参数:
>         <property>
>                 <name>yarn.nodemanager.resource.memory-mb</name>
>                 <value>16384</value>
>         </property>
>         <property>
>                 <name>yarn.scheduler.minimum-allocation-mb</name>
>                 <value>1024</value>
>         </property>
>         <property>
>                 <name>yarn.scheduler.maximum-allocation-mb</name>
>                 <value>8192</value>
>         </property>
>
>
>
> 然后我去启动flink on yarn任务,启动参数:./flink-1.8.1/bin/flink run -m yarn-cluster -p
> 4 -d -c streaming.StreamKafkaMain -yn 4 -ys 2 -yjm 2048 -ytm 8192  -ynm
> bigdata6 ./flink_bigdata6.jar
> 最终能能启动成功,就是启动要等待10分钟,这个等待时间太长了,请问各位大佬有什么解决的方法吗?
Reply | Threaded
Open this post in threaded view
|

回复: 回复: flink启动等待10分钟问题

守护
In reply to this post by Yun Tang
是这样的,在启动的是后客户端日志就停在那了,等待10分钟后才能生成yarn的application


------------------ 原始邮件 ------------------
发件人: "Yun Tang"<[hidden email]>;
发送时间: 2019年8月22日(星期四) 中午11:23
收件人: "user-zh"<[hidden email]>;
主题: Re: 回复: flink启动等待10分钟问题

启动时间是 20:00:25,task都处于running甚至第一次checkpoint completed的时间是20:00:42,一共才17秒,何来10分钟的问题?
________________________________
From: 々守护々 <[hidden email]>
Sent: Thursday, August 22, 2019 11:18
To: user-zh <[hidden email]>
Subject: 回复: flink启动等待10分钟问题

您好,这个是我jobmanager启动日志,请帮忙看看,谢谢!


2019-08-21 20:00:25,428 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - --------------------------------------------------------------------------------
2019-08-21 20:00:25,430 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Registered UNIX signal handlers for [TERM, HUP, INT]
2019-08-21 20:00:25,433 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - YARN daemon is running as: root Yarn client user obtainer: root
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.cluster-id, application_1566383236573_0003
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.address, localhost
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.port, 6123
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.zookeeper.path.root, /flink-yarn
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.storageDir, hdfs://host51:9000/flink/ha-yarn
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.zookeeper.quorum, host51:2181
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: internal.cluster.execution-mode, DETACHED
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability, zookeeper
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: parallelism.default, 1
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: yarn.application-attempts, 10
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.numberOfTaskSlots, 2
2019-08-21 20:00:25,439 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.heap.size, 8192m
2019-08-21 20:00:25,439 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.heap.size, 2048m
2019-08-21 20:00:25,465 WARN  org.apache.flink.configuration.Configuration                  - Config uses deprecated configuration key 'web.port' instead of proper key 'rest.bind-port'
2019-08-21 20:00:25,469 INFO  org.apache.flink.runtime.clusterframework.BootstrapTools      - Setting directories for temporary files to: /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003
2019-08-21 20:00:25,485 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Starting YarnJobClusterEntrypoint.
2019-08-21 20:00:25,485 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Install default filesystem.
2019-08-21 20:00:25,561 INFO  org.apache.flink.runtime.security.modules.HadoopModule        - Hadoop user set to root (auth:SIMPLE)
2019-08-21 20:00:25,580 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Initializing cluster services.
2019-08-21 20:00:25,757 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Trying to start actor system at host53:0
2019-08-21 20:00:26,187 INFO  akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger started
2019-08-21 20:00:26,251 INFO  akka.remote.Remoting                                          - Starting remoting
2019-08-21 20:00:26,439 INFO  akka.remote.Remoting                                          - Remoting started; listening on addresses :[akka.tcp://flink@host53:59609]
2019-08-21 20:00:26,447 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Actor system started at akka.tcp://flink@host53:59609
2019-08-21 20:00:27,011 INFO  org.apache.flink.runtime.blob.FileSystemBlobStore             - Creating highly available BLOB storage directory at hdfs://host51:9000/flink/ha-yarn/application_1566383236573_0003/blob
2019-08-21 20:00:27,097 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Enforcing default ACL for ZK connections
2019-08-21 20:00:27,097 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Using '/flink-yarn/application_1566383236573_0003' as Zookeeper namespace.
2019-08-21 20:00:27,161 INFO  org.apache.flink.shaded.curator.org.apache.curator.framework.imps.CuratorFrameworkImpl  - Starting
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:zookeeper.version=3.4.10-39d3a4f269333c922ed3db283be479f9deacaa0f, built on 03/23/2017 10:13 GMT
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:host.name=host53
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.version=1.8.0_77
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.vendor=Oracle Corporation
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.library.path=:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.io.tmpdir=/tmp
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.compiler=<NA>
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:os.name=Linux
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:os.arch=amd64
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:os.version=3.16.0-0.bpo.4-amd64
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:user.name=root
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:user.home=/root
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:user.dir=/usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/container_1566383236573_0003_01_000001
2019-08-21 20:00:27,171 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Initiating client connection, connectString=host51:2181 sessionTimeout=60000 watcher=org.apache.flink.shaded.curator.org.apache.curator.ConnectionState@319854f0
2019-08-21 20:00:27,184 WARN  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - SASL configuration failed: javax.security.auth.login.LoginException: No JAAS configuration section named 'Client' was found in specified JAAS configuration file: '/tmp/jaas-4274640356638794035.conf'. Will continue connection to Zookeeper server without SASL authentication, if Zookeeper server allows it.
2019-08-21 20:00:27,186 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Opening socket connection to server host51/10.10.10.51:2181
2019-08-21 20:00:27,187 ERROR org.apache.flink.shaded.curator.org.apache.curator.ConnectionState  - Authentication failed
2019-08-21 20:00:27,187 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Socket connection established to host51/10.10.10.51:2181, initiating session
2019-08-21 20:00:27,193 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Session establishment complete on server host51/10.10.10.51:2181, sessionid = 0x1001a86de810008, negotiated timeout = 40000
2019-08-21 20:00:27,193 INFO  org.apache.flink.runtime.blob.BlobServer                      - Created BLOB server storage directory /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-eaa2bb2b-cddb-42e3-8ce5-bf2f83e8567a
2019-08-21 20:00:27,195 INFO  org.apache.flink.shaded.curator.org.apache.curator.framework.state.ConnectionStateManager  - State change: CONNECTED
2019-08-21 20:00:27,198 INFO  org.apache.flink.runtime.blob.BlobServer                      - Started BLOB server at 0.0.0.0:58591 - max concurrent requests: 50 - max backlog: 1000
2019-08-21 20:00:27,213 INFO  org.apache.flink.runtime.metrics.MetricRegistryImpl           - No metrics reporter configured, no metrics will be exposed/reported.
2019-08-21 20:00:27,215 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Trying to start actor system at host53:0
2019-08-21 20:00:27,239 INFO  akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger started
2019-08-21 20:00:27,247 INFO  akka.remote.Remoting                                          - Starting remoting
2019-08-21 20:00:27,255 INFO  akka.remote.Remoting                                          - Remoting started; listening on addresses :[akka.tcp://flink-metrics@host53:59941]
2019-08-21 20:00:27,257 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Actor system started at akka.tcp://flink-metrics@host53:59941
2019-08-21 20:00:27,263 INFO  org.apache.flink.runtime.blob.TransientBlobCache              - Created BLOB cache storage directory /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-405a4797-824f-48da-bb50-2ee2e230d249
2019-08-21 20:00:27,296 WARN  org.apache.flink.configuration.Configuration                  - Config uses deprecated configuration key 'web.port' instead of proper key 'rest.bind-port'
2019-08-21 20:00:27,298 WARN  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Upload directory /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload does not exist, or has been deleted externally. Previously uploaded files are no longer available.
2019-08-21 20:00:27,298 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Created directory /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload for file uploads.
2019-08-21 20:00:27,320 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Starting rest endpoint.
2019-08-21 20:00:27,616 INFO  org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined location of main cluster component log file: /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.log
2019-08-21 20:00:27,616 INFO  org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined location of main cluster component stdout file: /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.out
2019-08-21 20:00:27,771 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Rest endpoint listening at host53:46401
2019-08-21 20:00:27,772 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/rest_server_lock'}.
2019-08-21 20:00:27,795 WARN  org.apache.flink.shaded.curator.org.apache.curator.utils.ZKPaths  - The version of ZooKeeper being used doesn't support Container nodes. CreateMode.PERSISTENT will be used instead.
2019-08-21 20:00:27,807 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Web frontend listening at http://host53:46401.
2019-08-21 20:00:27,836 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - http://host53:46401 was granted leadership with leaderSessionID=8487757f-fd82-42b2-a1a6-adcc69e1eba1
2019-08-21 20:00:27,932 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting RPC endpoint for org.apache.flink.yarn.YarnResourceManager at akka://flink/user/resourcemanager .
2019-08-21 20:00:28,021 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting RPC endpoint for org.apache.flink.runtime.dispatcher.MiniDispatcher at akka://flink/user/dispatcher .
2019-08-21 20:00:28,033 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
2019-08-21 20:00:28,033 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/dispatcher_lock.
2019-08-21 20:00:28,033 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/dispatcher_lock'}.
2019-08-21 20:00:28,045 INFO  org.apache.flink.runtime.dispatcher.MiniDispatcher            - Dispatcher akka.tcp://flink@host53:59609/user/dispatcher was granted leadership with fencing token 6216d17e-56b1-43bb-9bb3-f9fa5b8921be
2019-08-21 20:00:28,048 INFO  org.apache.flink.runtime.dispatcher.MiniDispatcher            - Recovering all persisted jobs.
2019-08-21 20:00:28,054 INFO  org.apache.hadoop.yarn.client.RMProxy                         - Connecting to ResourceManager at host51/10.10.10.51:8030
2019-08-21 20:00:28,078 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting RPC endpoint for org.apache.flink.runtime.jobmaster.JobMaster at akka://flink/user/jobmanager_0 .
2019-08-21 20:00:28,086 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Initializing job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
2019-08-21 20:00:28,099 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Using restart strategy FixedDelayRestartStrategy(maxNumberRestartAttempts=2147483647, delayBetweenRestartAttempts=0) for StreamKafkaMain (e5ea83238d721e932b4645589660406c).
2019-08-21 20:00:28,122 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job recovers via failover strategy: full graph restart
2019-08-21 20:00:28,153 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Running initialization on master for job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
2019-08-21 20:00:28,154 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Successfully ran initialization on master in 0 ms.
2019-08-21 20:00:28,187 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Initialized ZooKeeperCompletedCheckpointStore in '/checkpoints/e5ea83238d721e932b4645589660406c'.
2019-08-21 20:00:28,196 INFO  org.apache.flink.yarn.YarnResourceManager                     - Recovered 0 containers from previous attempts ([]).
2019-08-21 20:00:28,199 INFO  org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  - yarn.client.max-cached-nodemanagers-proxies : 0
2019-08-21 20:00:28,200 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/resource_manager_lock'}.
2019-08-21 20:00:28,203 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - No state backend has been configured, using default (Memory / JobManager) MemoryStateBackend (data in heap memory / checkpoints to JobManager) (checkpoints: 'null', savepoints: 'null', asynchronous: TRUE, maxStateSize: 5242880)
2019-08-21 20:00:28,211 INFO  org.apache.flink.yarn.YarnResourceManager                     - ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager was granted leadership with fencing token 97537c0d61bbf091e37cb6c45b784568
2019-08-21 20:00:28,214 INFO  org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  - Starting the SlotManager.
2019-08-21 20:00:28,225 INFO  org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  - Recovering checkpoints from ZooKeeper.
2019-08-21 20:00:28,229 INFO  org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  - Found 0 checkpoints in ZooKeeper.
2019-08-21 20:00:28,229 INFO  org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  - Trying to fetch 0 checkpoints from storage.
2019-08-21 20:00:28,233 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/e5ea83238d721e932b4645589660406c/job_manager_lock'}.
2019-08-21 20:00:28,248 INFO  org.apache.flink.runtime.jobmaster.JobManagerRunner           - JobManager runner for job StreamKafkaMain (e5ea83238d721e932b4645589660406c) was granted leadership with session id 12c96c94-ebbd-4991-a457-6f0320d71bc1 at akka.tcp://flink@host53:59609/user/jobmanager_0.
2019-08-21 20:00:28,260 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
2019-08-21 20:00:28,260 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Starting execution of job StreamKafkaMain (e5ea83238d721e932b4645589660406c) under job master id a4576f0320d71bc112c96c94ebbd4991.
2019-08-21 20:00:28,261 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job StreamKafkaMain (e5ea83238d721e932b4645589660406c) switched from state CREATED to RUNNING.
2019-08-21 20:00:28,266 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (ee39866835f7b7387f518e352787a532) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,280 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}]
2019-08-21 20:00:28,287 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (f1a893fc5e2440331c8e6db1874ad220) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,288 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}]
2019-08-21 20:00:28,288 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (4756a0450881325af5fb396a18dbb8ae) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}]
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (78f520582607e26b365fca483fc98d4c) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}]
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (0071e1c7c4f70646914329f81dcbd349) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,292 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,295 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Connecting to ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager(97537c0d61bbf091e37cb6c45b784568)
2019-08-21 20:00:28,300 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Resolved ResourceManager address, beginning registration
2019-08-21 20:00:28,300 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Registration at ResourceManager attempt 1 (timeout=100ms)
2019-08-21 20:00:28,304 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/e5ea83238d721e932b4645589660406c/job_manager_lock.
2019-08-21 20:00:28,304 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registering job manager [hidden email]://flink@host53:59609/user/jobmanager_0 for job e5ea83238d721e932b4645589660406c.
2019-08-21 20:00:28,310 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registered job manager [hidden email]://flink@host53:59609/user/jobmanager_0 for job e5ea83238d721e932b4645589660406c.
2019-08-21 20:00:28,313 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - JobManager successfully registered at ResourceManager, leader id: 97537c0d61bbf091e37cb6c45b784568.
2019-08-21 20:00:28,314 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,315 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id faa6a521758e5be4b3dc1955ca0f51b4.
2019-08-21 20:00:28,316 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,316 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,316 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,321 INFO  org.apache.flink.yarn.YarnResourceManager                     - Requesting new TaskExecutor container with resources <memory:8192, vCores:2>. Number pending requests 1.
2019-08-21 20:00:28,322 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id a2b419aea4a373686eeed6c46163a528.
2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id cbc11737ce6729cfe45f24c43180b5c8.
2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager                     - Requesting new TaskExecutor container with resources <memory:8192, vCores:2>. Number pending requests 2.
2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id f6566567c173a55b2bc52b72bcb24ff8.
2019-08-21 20:00:31,597 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint triggering task Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state RUNNING but SCHEDULED instead. Aborting checkpoint.
2019-08-21 20:00:33,749 INFO  org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received new token for : host53:59938
2019-08-21 20:00:33,749 INFO  org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received new token for : host52:33246
2019-08-21 20:00:33,765 INFO  org.apache.flink.yarn.YarnResourceManager                     - Received new container: container_1566383236573_0003_01_000002 - Remaining pending container requests: 2
2019-08-21 20:00:33,765 INFO  org.apache.flink.yarn.YarnResourceManager                     - Removing container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending container requests 1.
2019-08-21 20:00:34,009 INFO  org.apache.flink.yarn.YarnResourceManager                     - Creating container launch context for TaskManagers
2019-08-21 20:00:34,010 INFO  org.apache.flink.yarn.YarnResourceManager                     - Starting TaskManagers
2019-08-21 20:00:34,021 INFO  org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  - Opening proxy : host53:59938
2019-08-21 20:00:34,066 INFO  org.apache.flink.yarn.YarnResourceManager                     - Received new container: container_1566383236573_0003_01_000003 - Remaining pending container requests: 1
2019-08-21 20:00:34,066 INFO  org.apache.flink.yarn.YarnResourceManager                     - Removing container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending container requests 0.
2019-08-21 20:00:34,088 INFO  org.apache.flink.yarn.YarnResourceManager                     - Creating container launch context for TaskManagers
2019-08-21 20:00:34,089 INFO  org.apache.flink.yarn.YarnResourceManager                     - Starting TaskManagers
2019-08-21 20:00:34,090 INFO  org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  - Opening proxy : host52:33246
2019-08-21 20:00:36,597 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint triggering task Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state RUNNING but SCHEDULED instead. Aborting checkpoint.
2019-08-21 20:00:37,856 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registering TaskManager with ResourceID container_1566383236573_0003_01_000002 (akka.tcp://flink@host53:34608/user/taskmanager_0) at ResourceManager
2019-08-21 20:00:39,573 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registering TaskManager with ResourceID container_1566383236573_0003_01_000003 (akka.tcp://flink@host52:32931/user/taskmanager_0) at ResourceManager
2019-08-21 20:00:39,659 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (ee39866835f7b7387f518e352787a532) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,659 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,664 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (f1a893fc5e2440331c8e6db1874ad220) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,664 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,665 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (4756a0450881325af5fb396a18dbb8ae) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (78f520582607e26b365fca483fc98d4c) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (0071e1c7c4f70646914329f81dcbd349) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,670 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,670 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (1/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,671 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (2/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (3/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,673 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (4/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,673 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,673 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (1/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (2/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (3/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,676 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,676 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (4/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,782 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,783 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,784 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,787 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,806 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (f1a893fc5e2440331c8e6db1874ad220) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,809 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (ee39866835f7b7387f518e352787a532) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,809 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (0071e1c7c4f70646914329f81dcbd349) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,991 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,992 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,996 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,997 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:40,005 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (78f520582607e26b365fca483fc98d4c) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:40,006 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (4756a0450881325af5fb396a18dbb8ae) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:41,605 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Triggering checkpoint 1 @ 1566388841597 for job e5ea83238d721e932b4645589660406c.
2019-08-21 20:00:42,466 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Completed checkpoint 1 for job e5ea83238d721e932b4645589660406c (7994 bytes in 821 ms).



------------------ 原始邮件 ------------------
发件人: "Yun Tang"<[hidden email]>;
发送时间: 2019年8月22日(星期四) 中午11:08
收件人: "user-zh"<[hidden email]>;

主题: Re: flink启动等待10分钟问题



Hi

Flink on YARN作业启动时间长,有很多原因,例如资源不够在等待,container申请的时候又退出了。默认的slot request的timeout时间是5min,感觉你的作业应该是可能遇到了一个slot request timeout,然后又重新申请。最好能提供一下jobmanager的日志才好进一步分析。

祝好
唐云
________________________________
From: 々守护々 <[hidden email]>
Sent: Thursday, August 22, 2019 11:04
To: user-zh <[hidden email]>
Subject: flink启动等待10分钟问题

各位Flink社区大佬,您好!
      我使用Flink SQL (Flink 1.8.1),在hadoop 集群中调整了一下yarn-site.xml文件的内存参数:
        <property>
                <name>yarn.nodemanager.resource.memory-mb</name>
                <value>16384</value>
        </property>
        <property>
                <name>yarn.scheduler.minimum-allocation-mb</name>
                <value>1024</value>
        </property>
        <property>
                <name>yarn.scheduler.maximum-allocation-mb</name>
                <value>8192</value>
        </property>



然后我去启动flink on yarn任务,启动参数:./flink-1.8.1/bin/flink run -m yarn-cluster -p 4 -d -c streaming.StreamKafkaMain -yn 4 -ys 2 -yjm 2048 -ytm 8192  -ynm bigdata6 ./flink_bigdata6.jar
最终能能启动成功,就是启动要等待10分钟,这个等待时间太长了,请问各位大佬有什么解决的方法吗?
Reply | Threaded
Open this post in threaded view
|

回复: 回复: flink启动等待10分钟问题

tangjunliang@huitongjy.com
当时你的集群资源是否充足?



[hidden email]
 
发件人: 々守护々
发送时间: 2019-08-22 11:32
收件人: user-zh
主题: 回复: 回复: flink启动等待10分钟问题
是这样的,在启动的是后客户端日志就停在那了,等待10分钟后才能生成yarn的application


------------------ 原始邮件 ------------------
发件人: "Yun Tang"<[hidden email]>;
发送时间: 2019年8月22日(星期四) 中午11:23
收件人: "user-zh"<[hidden email]>;
主题: Re: 回复: flink启动等待10分钟问题

启动时间是 20:00:25,task都处于running甚至第一次checkpoint completed的时间是20:00:42,一共才17秒,何来10分钟的问题?
________________________________
From: 々守护々 <[hidden email]>
Sent: Thursday, August 22, 2019 11:18
To: user-zh <[hidden email]>
Subject: 回复: flink启动等待10分钟问题

您好,这个是我jobmanager启动日志,请帮忙看看,谢谢!


2019-08-21 20:00:25,428 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - --------------------------------------------------------------------------------
2019-08-21 20:00:25,430 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Registered UNIX signal handlers for [TERM, HUP, INT]
2019-08-21 20:00:25,433 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - YARN daemon is running as: root Yarn client user obtainer: root
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.cluster-id, application_1566383236573_0003
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.address, localhost
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.port, 6123
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.zookeeper.path.root, /flink-yarn
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.storageDir, hdfs://host51:9000/flink/ha-yarn
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.zookeeper.quorum, host51:2181
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: internal.cluster.execution-mode, DETACHED
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability, zookeeper
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: parallelism.default, 1
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: yarn.application-attempts, 10
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.numberOfTaskSlots, 2
2019-08-21 20:00:25,439 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.heap.size, 8192m
2019-08-21 20:00:25,439 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.heap.size, 2048m
2019-08-21 20:00:25,465 WARN  org.apache.flink.configuration.Configuration                  - Config uses deprecated configuration key 'web.port' instead of proper key 'rest.bind-port'
2019-08-21 20:00:25,469 INFO  org.apache.flink.runtime.clusterframework.BootstrapTools      - Setting directories for temporary files to: /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003
2019-08-21 20:00:25,485 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Starting YarnJobClusterEntrypoint.
2019-08-21 20:00:25,485 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Install default filesystem.
2019-08-21 20:00:25,561 INFO  org.apache.flink.runtime.security.modules.HadoopModule        - Hadoop user set to root (auth:SIMPLE)
2019-08-21 20:00:25,580 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Initializing cluster services.
2019-08-21 20:00:25,757 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Trying to start actor system at host53:0
2019-08-21 20:00:26,187 INFO  akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger started
2019-08-21 20:00:26,251 INFO  akka.remote.Remoting                                          - Starting remoting
2019-08-21 20:00:26,439 INFO  akka.remote.Remoting                                          - Remoting started; listening on addresses :[akka.tcp://flink@host53:59609]
2019-08-21 20:00:26,447 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Actor system started at akka.tcp://flink@host53:59609
2019-08-21 20:00:27,011 INFO  org.apache.flink.runtime.blob.FileSystemBlobStore             - Creating highly available BLOB storage directory at hdfs://host51:9000/flink/ha-yarn/application_1566383236573_0003/blob
2019-08-21 20:00:27,097 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Enforcing default ACL for ZK connections
2019-08-21 20:00:27,097 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Using '/flink-yarn/application_1566383236573_0003' as Zookeeper namespace.
2019-08-21 20:00:27,161 INFO  org.apache.flink.shaded.curator.org.apache.curator.framework.imps.CuratorFrameworkImpl  - Starting
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:zookeeper.version=3.4.10-39d3a4f269333c922ed3db283be479f9deacaa0f, built on 03/23/2017 10:13 GMT
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:host.name=host53
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.version=1.8.0_77
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.vendor=Oracle Corporation
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.library.path=:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.io.tmpdir=/tmp
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.compiler=<NA>
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:os.name=Linux
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:os.arch=amd64
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:os.version=3.16.0-0.bpo.4-amd64
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:user.name=root
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:user.home=/root
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:user.dir=/usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/container_1566383236573_0003_01_000001
2019-08-21 20:00:27,171 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Initiating client connection, connectString=host51:2181 sessionTimeout=60000 watcher=org.apache.flink.shaded.curator.org.apache.curator.ConnectionState@319854f0
2019-08-21 20:00:27,184 WARN  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - SASL configuration failed: javax.security.auth.login.LoginException: No JAAS configuration section named 'Client' was found in specified JAAS configuration file: '/tmp/jaas-4274640356638794035.conf'. Will continue connection to Zookeeper server without SASL authentication, if Zookeeper server allows it.
2019-08-21 20:00:27,186 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Opening socket connection to server host51/10.10.10.51:2181
2019-08-21 20:00:27,187 ERROR org.apache.flink.shaded.curator.org.apache.curator.ConnectionState  - Authentication failed
2019-08-21 20:00:27,187 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Socket connection established to host51/10.10.10.51:2181, initiating session
2019-08-21 20:00:27,193 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Session establishment complete on server host51/10.10.10.51:2181, sessionid = 0x1001a86de810008, negotiated timeout = 40000
2019-08-21 20:00:27,193 INFO  org.apache.flink.runtime.blob.BlobServer                      - Created BLOB server storage directory /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-eaa2bb2b-cddb-42e3-8ce5-bf2f83e8567a
2019-08-21 20:00:27,195 INFO  org.apache.flink.shaded.curator.org.apache.curator.framework.state.ConnectionStateManager  - State change: CONNECTED
2019-08-21 20:00:27,198 INFO  org.apache.flink.runtime.blob.BlobServer                      - Started BLOB server at 0.0.0.0:58591 - max concurrent requests: 50 - max backlog: 1000
2019-08-21 20:00:27,213 INFO  org.apache.flink.runtime.metrics.MetricRegistryImpl           - No metrics reporter configured, no metrics will be exposed/reported.
2019-08-21 20:00:27,215 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Trying to start actor system at host53:0
2019-08-21 20:00:27,239 INFO  akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger started
2019-08-21 20:00:27,247 INFO  akka.remote.Remoting                                          - Starting remoting
2019-08-21 20:00:27,255 INFO  akka.remote.Remoting                                          - Remoting started; listening on addresses :[akka.tcp://flink-metrics@host53:59941]
2019-08-21 20:00:27,257 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Actor system started at akka.tcp://flink-metrics@host53:59941
2019-08-21 20:00:27,263 INFO  org.apache.flink.runtime.blob.TransientBlobCache              - Created BLOB cache storage directory /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-405a4797-824f-48da-bb50-2ee2e230d249
2019-08-21 20:00:27,296 WARN  org.apache.flink.configuration.Configuration                  - Config uses deprecated configuration key 'web.port' instead of proper key 'rest.bind-port'
2019-08-21 20:00:27,298 WARN  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Upload directory /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload does not exist, or has been deleted externally. Previously uploaded files are no longer available.
2019-08-21 20:00:27,298 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Created directory /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload for file uploads.
2019-08-21 20:00:27,320 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Starting rest endpoint.
2019-08-21 20:00:27,616 INFO  org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined location of main cluster component log file: /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.log
2019-08-21 20:00:27,616 INFO  org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined location of main cluster component stdout file: /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.out
2019-08-21 20:00:27,771 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Rest endpoint listening at host53:46401
2019-08-21 20:00:27,772 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/rest_server_lock'}.
2019-08-21 20:00:27,795 WARN  org.apache.flink.shaded.curator.org.apache.curator.utils.ZKPaths  - The version of ZooKeeper being used doesn't support Container nodes. CreateMode.PERSISTENT will be used instead.
2019-08-21 20:00:27,807 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Web frontend listening at http://host53:46401.
2019-08-21 20:00:27,836 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - http://host53:46401 was granted leadership with leaderSessionID=8487757f-fd82-42b2-a1a6-adcc69e1eba1
2019-08-21 20:00:27,932 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting RPC endpoint for org.apache.flink.yarn.YarnResourceManager at akka://flink/user/resourcemanager .
2019-08-21 20:00:28,021 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting RPC endpoint for org.apache.flink.runtime.dispatcher.MiniDispatcher at akka://flink/user/dispatcher .
2019-08-21 20:00:28,033 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
2019-08-21 20:00:28,033 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/dispatcher_lock.
2019-08-21 20:00:28,033 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/dispatcher_lock'}.
2019-08-21 20:00:28,045 INFO  org.apache.flink.runtime.dispatcher.MiniDispatcher            - Dispatcher akka.tcp://flink@host53:59609/user/dispatcher was granted leadership with fencing token 6216d17e-56b1-43bb-9bb3-f9fa5b8921be
2019-08-21 20:00:28,048 INFO  org.apache.flink.runtime.dispatcher.MiniDispatcher            - Recovering all persisted jobs.
2019-08-21 20:00:28,054 INFO  org.apache.hadoop.yarn.client.RMProxy                         - Connecting to ResourceManager at host51/10.10.10.51:8030
2019-08-21 20:00:28,078 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting RPC endpoint for org.apache.flink.runtime.jobmaster.JobMaster at akka://flink/user/jobmanager_0 .
2019-08-21 20:00:28,086 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Initializing job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
2019-08-21 20:00:28,099 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Using restart strategy FixedDelayRestartStrategy(maxNumberRestartAttempts=2147483647, delayBetweenRestartAttempts=0) for StreamKafkaMain (e5ea83238d721e932b4645589660406c).
2019-08-21 20:00:28,122 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job recovers via failover strategy: full graph restart
2019-08-21 20:00:28,153 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Running initialization on master for job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
2019-08-21 20:00:28,154 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Successfully ran initialization on master in 0 ms.
2019-08-21 20:00:28,187 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Initialized ZooKeeperCompletedCheckpointStore in '/checkpoints/e5ea83238d721e932b4645589660406c'.
2019-08-21 20:00:28,196 INFO  org.apache.flink.yarn.YarnResourceManager                     - Recovered 0 containers from previous attempts ([]).
2019-08-21 20:00:28,199 INFO  org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  - yarn.client.max-cached-nodemanagers-proxies : 0
2019-08-21 20:00:28,200 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/resource_manager_lock'}.
2019-08-21 20:00:28,203 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - No state backend has been configured, using default (Memory / JobManager) MemoryStateBackend (data in heap memory / checkpoints to JobManager) (checkpoints: 'null', savepoints: 'null', asynchronous: TRUE, maxStateSize: 5242880)
2019-08-21 20:00:28,211 INFO  org.apache.flink.yarn.YarnResourceManager                     - ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager was granted leadership with fencing token 97537c0d61bbf091e37cb6c45b784568
2019-08-21 20:00:28,214 INFO  org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  - Starting the SlotManager.
2019-08-21 20:00:28,225 INFO  org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  - Recovering checkpoints from ZooKeeper.
2019-08-21 20:00:28,229 INFO  org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  - Found 0 checkpoints in ZooKeeper.
2019-08-21 20:00:28,229 INFO  org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  - Trying to fetch 0 checkpoints from storage.
2019-08-21 20:00:28,233 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/e5ea83238d721e932b4645589660406c/job_manager_lock'}.
2019-08-21 20:00:28,248 INFO  org.apache.flink.runtime.jobmaster.JobManagerRunner           - JobManager runner for job StreamKafkaMain (e5ea83238d721e932b4645589660406c) was granted leadership with session id 12c96c94-ebbd-4991-a457-6f0320d71bc1 at akka.tcp://flink@host53:59609/user/jobmanager_0.
2019-08-21 20:00:28,260 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
2019-08-21 20:00:28,260 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Starting execution of job StreamKafkaMain (e5ea83238d721e932b4645589660406c) under job master id a4576f0320d71bc112c96c94ebbd4991.
2019-08-21 20:00:28,261 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job StreamKafkaMain (e5ea83238d721e932b4645589660406c) switched from state CREATED to RUNNING.
2019-08-21 20:00:28,266 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (ee39866835f7b7387f518e352787a532) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,280 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}]
2019-08-21 20:00:28,287 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (f1a893fc5e2440331c8e6db1874ad220) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,288 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}]
2019-08-21 20:00:28,288 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (4756a0450881325af5fb396a18dbb8ae) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}]
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (78f520582607e26b365fca483fc98d4c) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}]
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (0071e1c7c4f70646914329f81dcbd349) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,292 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,295 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Connecting to ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager(97537c0d61bbf091e37cb6c45b784568)
2019-08-21 20:00:28,300 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Resolved ResourceManager address, beginning registration
2019-08-21 20:00:28,300 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Registration at ResourceManager attempt 1 (timeout=100ms)
2019-08-21 20:00:28,304 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/e5ea83238d721e932b4645589660406c/job_manager_lock.
2019-08-21 20:00:28,304 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registering job manager [hidden email]://flink@host53:59609/user/jobmanager_0 for job e5ea83238d721e932b4645589660406c.
2019-08-21 20:00:28,310 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registered job manager [hidden email]://flink@host53:59609/user/jobmanager_0 for job e5ea83238d721e932b4645589660406c.
2019-08-21 20:00:28,313 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - JobManager successfully registered at ResourceManager, leader id: 97537c0d61bbf091e37cb6c45b784568.
2019-08-21 20:00:28,314 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,315 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id faa6a521758e5be4b3dc1955ca0f51b4.
2019-08-21 20:00:28,316 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,316 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,316 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,321 INFO  org.apache.flink.yarn.YarnResourceManager                     - Requesting new TaskExecutor container with resources <memory:8192, vCores:2>. Number pending requests 1.
2019-08-21 20:00:28,322 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id a2b419aea4a373686eeed6c46163a528.
2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id cbc11737ce6729cfe45f24c43180b5c8.
2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager                     - Requesting new TaskExecutor container with resources <memory:8192, vCores:2>. Number pending requests 2.
2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id f6566567c173a55b2bc52b72bcb24ff8.
2019-08-21 20:00:31,597 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint triggering task Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state RUNNING but SCHEDULED instead. Aborting checkpoint.
2019-08-21 20:00:33,749 INFO  org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received new token for : host53:59938
2019-08-21 20:00:33,749 INFO  org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received new token for : host52:33246
2019-08-21 20:00:33,765 INFO  org.apache.flink.yarn.YarnResourceManager                     - Received new container: container_1566383236573_0003_01_000002 - Remaining pending container requests: 2
2019-08-21 20:00:33,765 INFO  org.apache.flink.yarn.YarnResourceManager                     - Removing container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending container requests 1.
2019-08-21 20:00:34,009 INFO  org.apache.flink.yarn.YarnResourceManager                     - Creating container launch context for TaskManagers
2019-08-21 20:00:34,010 INFO  org.apache.flink.yarn.YarnResourceManager                     - Starting TaskManagers
2019-08-21 20:00:34,021 INFO  org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  - Opening proxy : host53:59938
2019-08-21 20:00:34,066 INFO  org.apache.flink.yarn.YarnResourceManager                     - Received new container: container_1566383236573_0003_01_000003 - Remaining pending container requests: 1
2019-08-21 20:00:34,066 INFO  org.apache.flink.yarn.YarnResourceManager                     - Removing container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending container requests 0.
2019-08-21 20:00:34,088 INFO  org.apache.flink.yarn.YarnResourceManager                     - Creating container launch context for TaskManagers
2019-08-21 20:00:34,089 INFO  org.apache.flink.yarn.YarnResourceManager                     - Starting TaskManagers
2019-08-21 20:00:34,090 INFO  org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  - Opening proxy : host52:33246
2019-08-21 20:00:36,597 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint triggering task Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state RUNNING but SCHEDULED instead. Aborting checkpoint.
2019-08-21 20:00:37,856 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registering TaskManager with ResourceID container_1566383236573_0003_01_000002 (akka.tcp://flink@host53:34608/user/taskmanager_0) at ResourceManager
2019-08-21 20:00:39,573 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registering TaskManager with ResourceID container_1566383236573_0003_01_000003 (akka.tcp://flink@host52:32931/user/taskmanager_0) at ResourceManager
2019-08-21 20:00:39,659 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (ee39866835f7b7387f518e352787a532) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,659 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,664 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (f1a893fc5e2440331c8e6db1874ad220) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,664 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,665 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (4756a0450881325af5fb396a18dbb8ae) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (78f520582607e26b365fca483fc98d4c) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (0071e1c7c4f70646914329f81dcbd349) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,670 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,670 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (1/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,671 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (2/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (3/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,673 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (4/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,673 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,673 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (1/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (2/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (3/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,676 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,676 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (4/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,782 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,783 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,784 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,787 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,806 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (f1a893fc5e2440331c8e6db1874ad220) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,809 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (ee39866835f7b7387f518e352787a532) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,809 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (0071e1c7c4f70646914329f81dcbd349) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,991 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,992 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,996 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,997 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:40,005 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (78f520582607e26b365fca483fc98d4c) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:40,006 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (4756a0450881325af5fb396a18dbb8ae) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:41,605 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Triggering checkpoint 1 @ 1566388841597 for job e5ea83238d721e932b4645589660406c.
2019-08-21 20:00:42,466 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Completed checkpoint 1 for job e5ea83238d721e932b4645589660406c (7994 bytes in 821 ms).



------------------ 原始邮件 ------------------
发件人: "Yun Tang"<[hidden email]>;
发送时间: 2019年8月22日(星期四) 中午11:08
收件人: "user-zh"<[hidden email]>;

主题: Re: flink启动等待10分钟问题



Hi

Flink on YARN作业启动时间长,有很多原因,例如资源不够在等待,container申请的时候又退出了。默认的slot request的timeout时间是5min,感觉你的作业应该是可能遇到了一个slot request timeout,然后又重新申请。最好能提供一下jobmanager的日志才好进一步分析。

祝好
唐云
________________________________
From: 々守护々 <[hidden email]>
Sent: Thursday, August 22, 2019 11:04
To: user-zh <[hidden email]>
Subject: flink启动等待10分钟问题

各位Flink社区大佬,您好!
      我使用Flink SQL (Flink 1.8.1),在hadoop 集群中调整了一下yarn-site.xml文件的内存参数:
        <property>
                <name>yarn.nodemanager.resource.memory-mb</name>
                <value>16384</value>
        </property>
        <property>
                <name>yarn.scheduler.minimum-allocation-mb</name>
                <value>1024</value>
        </property>
        <property>
                <name>yarn.scheduler.maximum-allocation-mb</name>
                <value>8192</value>
        </property>



然后我去启动flink on yarn任务,启动参数:./flink-1.8.1/bin/flink run -m yarn-cluster -p 4 -d -c streaming.StreamKafkaMain -yn 4 -ys 2 -yjm 2048 -ytm 8192  -ynm bigdata6 ./flink_bigdata6.jar
最终能能启动成功,就是启动要等待10分钟,这个等待时间太长了,请问各位大佬有什么解决的方法吗?
Reply | Threaded
Open this post in threaded view
|

Re: 回复: flink启动等待10分钟问题

tison
In reply to this post by 守护
user-zh 不支持贴图,你用下第三方存储然后贴个链接吧,或者我记得可以传邮件附件

Best,
tison.


々守护々 <[hidden email]> 于2019年8月22日周四 上午11:33写道:

> 是这样的,在启动的是后客户端日志就停在那了,等待10分钟后才能生成yarn的application
> <http://hadoop.changba.com/cluster/app/application_1566383236573_0003>
>
>
> ------------------ 原始邮件 ------------------
> *发件人:* "Yun Tang"<[hidden email]>;
> *发送时间:* 2019年8月22日(星期四) 中午11:23
> *收件人:* "user-zh"<[hidden email]>;
> *主题:* Re: 回复: flink启动等待10分钟问题
>
> 启动时间是 20:00:25,task都处于running甚至第一次checkpoint
> completed的时间是20:00:42,一共才17秒,何来10分钟的问题?
> ________________________________
> From: 々守护々 <[hidden email]>
> Sent: Thursday, August 22, 2019 11:18
> To: user-zh <[hidden email]>
> Subject: 回复: flink启动等待10分钟问题
>
> 您好,这个是我jobmanager启动日志,请帮忙看看,谢谢!
>
>
> 2019-08-21 20:00:25,428 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -
> --------------------------------------------------------------------------------
> 2019-08-21 20:00:25,430 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Registered
> UNIX signal handlers for [TERM, HUP, INT]
> 2019-08-21 20:00:25,433 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - YARN daemon
> is running as: root Yarn client user obtainer: root
> 2019-08-21 20:00:25,437 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: high-availability.cluster-id,
> application_1566383236573_0003
> 2019-08-21 20:00:25,437 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.rpc.address, localhost
> 2019-08-21 20:00:25,437 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.rpc.port, 6123
> 2019-08-21 20:00:25,437 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: high-availability.zookeeper.path.root, /flink-yarn
> 2019-08-21 20:00:25,437 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: high-availability.storageDir,
> hdfs://host51:9000/flink/ha-yarn
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: high-availability.zookeeper.quorum, host51:2181
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: internal.cluster.execution-mode, DETACHED
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: high-availability, zookeeper
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: parallelism.default, 1
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: yarn.application-attempts, 10
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.numberOfTaskSlots, 2
> 2019-08-21 20:00:25,439 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.heap.size, 8192m
> 2019-08-21 20:00:25,439 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.heap.size, 2048m
> 2019-08-21 20:00:25,465 WARN
> org.apache.flink.configuration.Configuration                  - Config uses
> deprecated configuration key 'web.port' instead of proper key
> 'rest.bind-port'
> 2019-08-21 20:00:25,469 INFO
> org.apache.flink.runtime.clusterframework.BootstrapTools      - Setting
> directories for temporary files to:
> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003
> 2019-08-21 20:00:25,485 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Starting
> YarnJobClusterEntrypoint.
> 2019-08-21 20:00:25,485 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Install
> default filesystem.
> 2019-08-21 20:00:25,561 INFO
> org.apache.flink.runtime.security.modules.HadoopModule        - Hadoop user
> set to root (auth:SIMPLE)
> 2019-08-21 20:00:25,580 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -
> Initializing cluster services.
> 2019-08-21 20:00:25,757 INFO
> org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Trying to
> start actor system at host53:0
> 2019-08-21 20:00:26,187 INFO
> akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger
> started
> 2019-08-21 20:00:26,251 INFO
> akka.remote.Remoting                                          - Starting
> remoting
> 2019-08-21 20:00:26,439 INFO
> akka.remote.Remoting                                          - Remoting
> started; listening on addresses :[akka.tcp://flink@host53:59609]
> 2019-08-21 20:00:26,447 INFO
> org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Actor
> system started at akka.tcp://flink@host53:59609
> 2019-08-21 20:00:27,011 INFO
> org.apache.flink.runtime.blob.FileSystemBlobStore             - Creating
> highly available BLOB storage directory at
> hdfs://host51:9000/flink/ha-yarn/application_1566383236573_0003/blob
> 2019-08-21 20:00:27,097 INFO
> org.apache.flink.runtime.util.ZooKeeperUtils                  - Enforcing
> default ACL for ZK connections
> 2019-08-21 20:00:27,097 INFO
> org.apache.flink.runtime.util.ZooKeeperUtils                  - Using
> '/flink-yarn/application_1566383236573_0003' as Zookeeper namespace.
> 2019-08-21 20:00:27,161 INFO
> org.apache.flink.shaded.curator.org.apache.curator.framework.imps.CuratorFrameworkImpl
> - Starting
> 2019-08-21 20:00:27,169 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:zookeeper.version=3.4.10-39d3a4f269333c922ed3db283be479f9deacaa0f,
> built on 03/23/2017 10:13 GMT
> 2019-08-21 20:00:27,169 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:host.name=host53
> 2019-08-21 20:00:27,169 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:java.version=1.8.0_77
> 2019-08-21 20:00:27,169 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:java.vendor=Oracle Corporation
> 2019-08-21 20:00:27,169 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:java.library.path=:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:java.io.tmpdir=/tmp
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:java.compiler=<NA>
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:os.name=Linux
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:os.arch=amd64
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:os.version=3.16.0-0.bpo.4-amd64
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:user.name=root
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:user.home=/root
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:user.dir=/usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/container_1566383236573_0003_01_000001
> 2019-08-21 20:00:27,171 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Initiating client connection, connectString=host51:2181
> sessionTimeout=60000
> watcher=org.apache.flink.shaded.curator.org.apache.curator.ConnectionState@319854f0
> 2019-08-21 20:00:27,184 WARN
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - SASL
> configuration failed: javax.security.auth.login.LoginException: No JAAS
> configuration section named 'Client' was found in specified JAAS
> configuration file: '/tmp/jaas-4274640356638794035.conf'. Will continue
> connection to Zookeeper server without SASL authentication, if Zookeeper
> server allows it.
> 2019-08-21 20:00:27,186 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  -
> Opening socket connection to server host51/10.10.10.51:2181
> 2019-08-21 20:00:27,187 ERROR
> org.apache.flink.shaded.curator.org.apache.curator.ConnectionState  -
> Authentication failed
> 2019-08-21 20:00:27,187 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Socket
> connection established to host51/10.10.10.51:2181, initiating session
> 2019-08-21 20:00:27,193 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  -
> Session establishment complete on server host51/10.10.10.51:2181,
> sessionid = 0x1001a86de810008, negotiated timeout = 40000
> 2019-08-21 20:00:27,193 INFO
> org.apache.flink.runtime.blob.BlobServer                      - Created
> BLOB server storage directory
> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-eaa2bb2b-cddb-42e3-8ce5-bf2f83e8567a
> 2019-08-21 20:00:27,195 INFO
> org.apache.flink.shaded.curator.org.apache.curator.framework.state.ConnectionStateManager
> - State change: CONNECTED
> 2019-08-21 20:00:27,198 INFO
> org.apache.flink.runtime.blob.BlobServer                      - Started
> BLOB server at 0.0.0.0:58591 - max concurrent requests: 50 - max backlog:
> 1000
> 2019-08-21 20:00:27,213 INFO
> org.apache.flink.runtime.metrics.MetricRegistryImpl           - No metrics
> reporter configured, no metrics will be exposed/reported.
> 2019-08-21 20:00:27,215 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Trying to
> start actor system at host53:0
> 2019-08-21 20:00:27,239 INFO
> akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger
> started
> 2019-08-21 20:00:27,247 INFO
> akka.remote.Remoting                                          - Starting
> remoting
> 2019-08-21 20:00:27,255 INFO
> akka.remote.Remoting                                          - Remoting
> started; listening on addresses :[akka.tcp://flink-metrics@host53:59941]
> 2019-08-21 20:00:27,257 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Actor
> system started at akka.tcp://flink-metrics@host53:59941
> 2019-08-21 20:00:27,263 INFO
> org.apache.flink.runtime.blob.TransientBlobCache              - Created
> BLOB cache storage directory
> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-405a4797-824f-48da-bb50-2ee2e230d249
> 2019-08-21 20:00:27,296 WARN
> org.apache.flink.configuration.Configuration                  - Config uses
> deprecated configuration key 'web.port' instead of proper key
> 'rest.bind-port'
> 2019-08-21 20:00:27,298 WARN
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Upload
> directory
> /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload does
> not exist, or has been deleted externally. Previously uploaded files are no
> longer available.
> 2019-08-21 20:00:27,298 INFO
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Created
> directory
> /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload for
> file uploads.
> 2019-08-21 20:00:27,320 INFO
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Starting
> rest endpoint.
> 2019-08-21 20:00:27,616 INFO
> org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined
> location of main cluster component log file:
> /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.log
> 2019-08-21 20:00:27,616 INFO
> org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined
> location of main cluster component stdout file:
> /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.out
> 2019-08-21 20:00:27,771 INFO
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Rest
> endpoint listening at host53:46401
> 2019-08-21 20:00:27,772 INFO
> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
> Starting ZooKeeperLeaderElectionService
> ZooKeeperLeaderElectionService{leaderPath='/leader/rest_server_lock'}.
> 2019-08-21 20:00:27,795 WARN
> org.apache.flink.shaded.curator.org.apache.curator.utils.ZKPaths  - The
> version of ZooKeeper being used doesn't support Container nodes.
> CreateMode.PERSISTENT will be used instead.
> 2019-08-21 20:00:27,807 INFO
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Web
> frontend listening at http://host53:46401.
> 2019-08-21 20:00:27,836 INFO
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  -
> http://host53:46401 was granted leadership with
> leaderSessionID=8487757f-fd82-42b2-a1a6-adcc69e1eba1
> 2019-08-21 20:00:27,932 INFO
> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
> RPC endpoint for org.apache.flink.yarn.YarnResourceManager at
> akka://flink/user/resourcemanager .
> 2019-08-21 20:00:28,021 INFO
> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
> RPC endpoint for org.apache.flink.runtime.dispatcher.MiniDispatcher at
> akka://flink/user/dispatcher .
> 2019-08-21 20:00:28,033 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
> 2019-08-21 20:00:28,033 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService /leader/dispatcher_lock.
> 2019-08-21 20:00:28,033 INFO
> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
> Starting ZooKeeperLeaderElectionService
> ZooKeeperLeaderElectionService{leaderPath='/leader/dispatcher_lock'}.
> 2019-08-21 20:00:28,045 INFO
> org.apache.flink.runtime.dispatcher.MiniDispatcher            - Dispatcher
> akka.tcp://flink@host53:59609/user/dispatcher was granted leadership with
> fencing token 6216d17e-56b1-43bb-9bb3-f9fa5b8921be
> 2019-08-21 20:00:28,048 INFO
> org.apache.flink.runtime.dispatcher.MiniDispatcher            - Recovering
> all persisted jobs.
> 2019-08-21 20:00:28,054 INFO
> org.apache.hadoop.yarn.client.RMProxy                         - Connecting
> to ResourceManager at host51/10.10.10.51:8030
> 2019-08-21 20:00:28,078 INFO
> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
> RPC endpoint for org.apache.flink.runtime.jobmaster.JobMaster at
> akka://flink/user/jobmanager_0 .
> 2019-08-21 20:00:28,086 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  -
> Initializing job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
> 2019-08-21 20:00:28,099 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - Using
> restart strategy
> FixedDelayRestartStrategy(maxNumberRestartAttempts=2147483647,
> delayBetweenRestartAttempts=0) for StreamKafkaMain
> (e5ea83238d721e932b4645589660406c).
> 2019-08-21 20:00:28,122 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job
> recovers via failover strategy: full graph restart
> 2019-08-21 20:00:28,153 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - Running
> initialization on master for job StreamKafkaMain
> (e5ea83238d721e932b4645589660406c).
> 2019-08-21 20:00:28,154 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  -
> Successfully ran initialization on master in 0 ms.
> 2019-08-21 20:00:28,187 INFO
> org.apache.flink.runtime.util.ZooKeeperUtils                  - Initialized
> ZooKeeperCompletedCheckpointStore in
> '/checkpoints/e5ea83238d721e932b4645589660406c'.
> 2019-08-21 20:00:28,196 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Recovered 0
> containers from previous attempts ([]).
> 2019-08-21 20:00:28,199 INFO
> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
> yarn.client.max-cached-nodemanagers-proxies : 0
> 2019-08-21 20:00:28,200 INFO
> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
> Starting ZooKeeperLeaderElectionService
> ZooKeeperLeaderElectionService{leaderPath='/leader/resource_manager_lock'}.
> 2019-08-21 20:00:28,203 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - No state
> backend has been configured, using default (Memory / JobManager)
> MemoryStateBackend (data in heap memory / checkpoints to JobManager)
> (checkpoints: 'null', savepoints: 'null', asynchronous: TRUE, maxStateSize:
> 5242880)
> 2019-08-21 20:00:28,211 INFO
> org.apache.flink.yarn.YarnResourceManager                     -
> ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager was
> granted leadership with fencing token 97537c0d61bbf091e37cb6c45b784568
> 2019-08-21 20:00:28,214 INFO
> org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  -
> Starting the SlotManager.
> 2019-08-21 20:00:28,225 INFO
> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
> Recovering checkpoints from ZooKeeper.
> 2019-08-21 20:00:28,229 INFO
> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
> Found 0 checkpoints in ZooKeeper.
> 2019-08-21 20:00:28,229 INFO
> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
> Trying to fetch 0 checkpoints from storage.
> 2019-08-21 20:00:28,233 INFO
> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
> Starting ZooKeeperLeaderElectionService
> ZooKeeperLeaderElectionService{leaderPath='/leader/e5ea83238d721e932b4645589660406c/job_manager_lock'}.
> 2019-08-21 20:00:28,248 INFO
> org.apache.flink.runtime.jobmaster.JobManagerRunner           - JobManager
> runner for job StreamKafkaMain (e5ea83238d721e932b4645589660406c) was
> granted leadership with session id 12c96c94-ebbd-4991-a457-6f0320d71bc1 at
> akka.tcp://flink@host53:59609/user/jobmanager_0.
> 2019-08-21 20:00:28,260 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
> 2019-08-21 20:00:28,260 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - Starting
> execution of job StreamKafkaMain (e5ea83238d721e932b4645589660406c) under
> job master id a4576f0320d71bc112c96c94ebbd4991.
> 2019-08-21 20:00:28,261 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job
> StreamKafkaMain (e5ea83238d721e932b4645589660406c) switched from state
> CREATED to RUNNING.
> 2019-08-21 20:00:28,266 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
> (ee39866835f7b7387f518e352787a532) switched from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,280 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> serve slot request, no ResourceManager connected. Adding as pending request
> [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}]
> 2019-08-21 20:00:28,287 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
> (f1a893fc5e2440331c8e6db1874ad220) switched from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,288 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> serve slot request, no ResourceManager connected. Adding as pending request
> [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}]
> 2019-08-21 20:00:28,288 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
> (4756a0450881325af5fb396a18dbb8ae) switched from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,289 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> serve slot request, no ResourceManager connected. Adding as pending request
> [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}]
> 2019-08-21 20:00:28,289 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
> (78f520582607e26b365fca483fc98d4c) switched from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,289 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> serve slot request, no ResourceManager connected. Adding as pending request
> [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}]
> 2019-08-21 20:00:28,289 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select:
> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
> (0071e1c7c4f70646914329f81dcbd349) switched from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from CREATED
> to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from CREATED
> to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from CREATED
> to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from CREATED
> to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched
> from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched
> from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched
> from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,292 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched
> from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,295 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - Connecting
> to ResourceManager akka.tcp://flink@host53
> :59609/user/resourcemanager(97537c0d61bbf091e37cb6c45b784568)
> 2019-08-21 20:00:28,300 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - Resolved
> ResourceManager address, beginning registration
> 2019-08-21 20:00:28,300 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  -
> Registration at ResourceManager attempt 1 (timeout=100ms)
> 2019-08-21 20:00:28,304 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService
> /leader/e5ea83238d721e932b4645589660406c/job_manager_lock.
> 2019-08-21 20:00:28,304 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Registering
> job manager [hidden email]://flink@host53:59609/user/jobmanager_0
> for job e5ea83238d721e932b4645589660406c.
> 2019-08-21 20:00:28,310 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Registered
> job manager [hidden email]://flink@host53:59609/user/jobmanager_0
> for job e5ea83238d721e932b4645589660406c.
> 2019-08-21 20:00:28,313 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - JobManager
> successfully registered at ResourceManager, leader id:
> 97537c0d61bbf091e37cb6c45b784568.
> 2019-08-21 20:00:28,314 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
> new slot [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}] and profile
> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> 2019-08-21 20:00:28,315 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Request
> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
> e5ea83238d721e932b4645589660406c with allocation id
> faa6a521758e5be4b3dc1955ca0f51b4.
> 2019-08-21 20:00:28,316 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
> new slot [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}] and profile
> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> 2019-08-21 20:00:28,316 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
> new slot [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}] and profile
> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> 2019-08-21 20:00:28,316 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
> new slot [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}] and profile
> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> 2019-08-21 20:00:28,321 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Requesting
> new TaskExecutor container with resources <memory:8192, vCores:2>. Number
> pending requests 1.
> 2019-08-21 20:00:28,322 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Request
> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
> e5ea83238d721e932b4645589660406c with allocation id
> a2b419aea4a373686eeed6c46163a528.
> 2019-08-21 20:00:28,323 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Request
> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
> e5ea83238d721e932b4645589660406c with allocation id
> cbc11737ce6729cfe45f24c43180b5c8.
> 2019-08-21 20:00:28,323 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Requesting
> new TaskExecutor container with resources <memory:8192, vCores:2>. Number
> pending requests 2.
> 2019-08-21 20:00:28,323 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Request
> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
> e5ea83238d721e932b4645589660406c with allocation id
> f6566567c173a55b2bc52b72bcb24ff8.
> 2019-08-21 20:00:31,597 INFO
> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint
> triggering task Source: Custom Source -> Flat Map -> Filter -> Filter ->
> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong,
> rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute:
> (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state
> RUNNING but SCHEDULED instead. Aborting checkpoint.
> 2019-08-21 20:00:33,749 INFO
> org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received
> new token for : host53:59938
> 2019-08-21 20:00:33,749 INFO
> org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received
> new token for : host52:33246
> 2019-08-21 20:00:33,765 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Received
> new container: container_1566383236573_0003_01_000002 - Remaining pending
> container requests: 2
> 2019-08-21 20:00:33,765 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Removing
> container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending
> container requests 1.
> 2019-08-21 20:00:34,009 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Creating
> container launch context for TaskManagers
> 2019-08-21 20:00:34,010 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Starting
> TaskManagers
> 2019-08-21 20:00:34,021 INFO
> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
> Opening proxy : host53:59938
> 2019-08-21 20:00:34,066 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Received
> new container: container_1566383236573_0003_01_000003 - Remaining pending
> container requests: 1
> 2019-08-21 20:00:34,066 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Removing
> container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending
> container requests 0.
> 2019-08-21 20:00:34,088 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Creating
> container launch context for TaskManagers
> 2019-08-21 20:00:34,089 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Starting
> TaskManagers
> 2019-08-21 20:00:34,090 INFO
> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
> Opening proxy : host52:33246
> 2019-08-21 20:00:36,597 INFO
> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint
> triggering task Source: Custom Source -> Flat Map -> Filter -> Filter ->
> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong,
> rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute:
> (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state
> RUNNING but SCHEDULED instead. Aborting checkpoint.
> 2019-08-21 20:00:37,856 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Registering
> TaskManager with ResourceID container_1566383236573_0003_01_000002
> (akka.tcp://flink@host53:34608/user/taskmanager_0) at ResourceManager
> 2019-08-21 20:00:39,573 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Registering
> TaskManager with ResourceID container_1566383236573_0003_01_000003
> (akka.tcp://flink@host52:32931/user/taskmanager_0) at ResourceManager
> 2019-08-21 20:00:39,659 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
> (ee39866835f7b7387f518e352787a532) switched from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,659 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
> (attempt #0) to container_1566383236573_0003_01_000002 @ host53
> (dataPort=52456)
> 2019-08-21 20:00:39,664 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
> (f1a893fc5e2440331c8e6db1874ad220) switched from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,664 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
> (attempt #0) to container_1566383236573_0003_01_000002 @ host53
> (dataPort=52456)
> 2019-08-21 20:00:39,665 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
> (4756a0450881325af5fb396a18dbb8ae) switched from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,666 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
> (attempt #0) to container_1566383236573_0003_01_000003 @ host52
> (dataPort=39650)
> 2019-08-21 20:00:39,666 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
> (78f520582607e26b365fca483fc98d4c) switched from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,666 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
> (attempt #0) to container_1566383236573_0003_01_000003 @ host52
> (dataPort=39650)
> 2019-08-21 20:00:39,666 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select:
> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
> (0071e1c7c4f70646914329f81dcbd349) switched from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,666 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select:
> (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start,
> end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime)
> -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (attempt
> #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> 2019-08-21 20:00:39,670 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from
> SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,670 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Sink: Print to Std. Out (1/4) (attempt #0) to
> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> 2019-08-21 20:00:39,671 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from
> SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,672 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Sink: Print to Std. Out (2/4) (attempt #0) to
> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> 2019-08-21 20:00:39,672 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from
> SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,672 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Sink: Print to Std. Out (3/4) (attempt #0) to
> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> 2019-08-21 20:00:39,672 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from
> SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,673 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Sink: Print to Std. Out (4/4) (attempt #0) to
> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> 2019-08-21 20:00:39,673 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched
> from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,673 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Map -> Sink: Print to Std. Out (1/4) (attempt #0) to
> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> 2019-08-21 20:00:39,675 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched
> from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,675 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Map -> Sink: Print to Std. Out (2/4) (attempt #0) to
> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> 2019-08-21 20:00:39,675 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched
> from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,675 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Map -> Sink: Print to Std. Out (3/4) (attempt #0) to
> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> 2019-08-21 20:00:39,676 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched
> from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,676 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Map -> Sink: Print to Std. Out (4/4) (attempt #0) to
> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> 2019-08-21 20:00:39,782 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from
> DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,783 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from
> DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,784 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched
> from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,787 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched
> from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,806 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
> (f1a893fc5e2440331c8e6db1874ad220) switched from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,809 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
> (ee39866835f7b7387f518e352787a532) switched from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,809 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select:
> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
> (0071e1c7c4f70646914329f81dcbd349) switched from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,991 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from
> DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,992 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from
> DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,996 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched
> from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,997 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched
> from DEPLOYING to RUNNING.
> 2019-08-21 20:00:40,005 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
> (78f520582607e26b365fca483fc98d4c) switched from DEPLOYING to RUNNING.
> 2019-08-21 20:00:40,006 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
> (4756a0450881325af5fb396a18dbb8ae) switched from DEPLOYING to RUNNING.
> 2019-08-21 20:00:41,605 INFO
> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Triggering
> checkpoint 1 @ 1566388841597 for job e5ea83238d721e932b4645589660406c.
> 2019-08-21 20:00:42,466 INFO
> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Completed
> checkpoint 1 for job e5ea83238d721e932b4645589660406c (7994 bytes in 821
> ms).
>
>
>
> ------------------ 原始邮件 ------------------
> 发件人: "Yun Tang"<[hidden email]>;
> 发送时间: 2019年8月22日(星期四) 中午11:08
> 收件人: "user-zh"<[hidden email]>;
>
> 主题: Re: flink启动等待10分钟问题
>
>
>
> Hi
>
> Flink on YARN作业启动时间长,有很多原因,例如资源不够在等待,container申请的时候又退出了。默认的slot
> request的timeout时间是5min,感觉你的作业应该是可能遇到了一个slot request
> timeout,然后又重新申请。最好能提供一下jobmanager的日志才好进一步分析。
>
> 祝好
> 唐云
> ________________________________
> From: 々守护々 <[hidden email]>
> Sent: Thursday, August 22, 2019 11:04
> To: user-zh <[hidden email]>
> Subject: flink启动等待10分钟问题
>
> 各位Flink社区大佬,您好!
>       我使用Flink SQL (Flink 1.8.1),在hadoop 集群中调整了一下yarn-site.xml文件的内存参数:
>         <property>
>                 <name>yarn.nodemanager.resource.memory-mb</name>
>                 <value>16384</value>
>         </property>
>         <property>
>                 <name>yarn.scheduler.minimum-allocation-mb</name>
>                 <value>1024</value>
>         </property>
>         <property>
>                 <name>yarn.scheduler.maximum-allocation-mb</name>
>                 <value>8192</value>
>         </property>
>
>
>
> 然后我去启动flink on yarn任务,启动参数:./flink-1.8.1/bin/flink run -m yarn-cluster -p
> 4 -d -c streaming.StreamKafkaMain -yn 4 -ys 2 -yjm 2048 -ytm 8192  -ynm
> bigdata6 ./flink_bigdata6.jar
> 最终能能启动成功,就是启动要等待10分钟,这个等待时间太长了,请问各位大佬有什么解决的方法吗?
>
Reply | Threaded
Open this post in threaded view
|

Re: 回复: flink启动等待10分钟问题

tison
你说的【停在那儿了】是说 flink run 的终端输出不动了吗?你看一下这个终端输出里 YARN 是什么时候 accept 你的应用的,我怀疑是
YARN 集群忙导致 10 分钟没响应。

Best,
tison.


Zili Chen <[hidden email]> 于2019年8月22日周四 上午11:35写道:

> user-zh 不支持贴图,你用下第三方存储然后贴个链接吧,或者我记得可以传邮件附件
>
> Best,
> tison.
>
>
> 々守护々 <[hidden email]> 于2019年8月22日周四 上午11:33写道:
>
>> 是这样的,在启动的是后客户端日志就停在那了,等待10分钟后才能生成yarn的application
>> <http://hadoop.changba.com/cluster/app/application_1566383236573_0003>
>>
>>
>> ------------------ 原始邮件 ------------------
>> *发件人:* "Yun Tang"<[hidden email]>;
>> *发送时间:* 2019年8月22日(星期四) 中午11:23
>> *收件人:* "user-zh"<[hidden email]>;
>> *主题:* Re: 回复: flink启动等待10分钟问题
>>
>> 启动时间是 20:00:25,task都处于running甚至第一次checkpoint
>> completed的时间是20:00:42,一共才17秒,何来10分钟的问题?
>> ________________________________
>> From: 々守护々 <[hidden email]>
>> Sent: Thursday, August 22, 2019 11:18
>> To: user-zh <[hidden email]>
>> Subject: 回复: flink启动等待10分钟问题
>>
>> 您好,这个是我jobmanager启动日志,请帮忙看看,谢谢!
>>
>>
>> 2019-08-21 20:00:25,428 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -
>> --------------------------------------------------------------------------------
>> 2019-08-21 20:00:25,430 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Registered
>> UNIX signal handlers for [TERM, HUP, INT]
>> 2019-08-21 20:00:25,433 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - YARN daemon
>> is running as: root Yarn client user obtainer: root
>> 2019-08-21 20:00:25,437 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: high-availability.cluster-id,
>> application_1566383236573_0003
>> 2019-08-21 20:00:25,437 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.rpc.address, localhost
>> 2019-08-21 20:00:25,437 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.rpc.port, 6123
>> 2019-08-21 20:00:25,437 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: high-availability.zookeeper.path.root, /flink-yarn
>> 2019-08-21 20:00:25,437 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: high-availability.storageDir,
>> hdfs://host51:9000/flink/ha-yarn
>> 2019-08-21 20:00:25,438 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: high-availability.zookeeper.quorum, host51:2181
>> 2019-08-21 20:00:25,438 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: internal.cluster.execution-mode, DETACHED
>> 2019-08-21 20:00:25,438 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: high-availability, zookeeper
>> 2019-08-21 20:00:25,438 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: parallelism.default, 1
>> 2019-08-21 20:00:25,438 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: yarn.application-attempts, 10
>> 2019-08-21 20:00:25,438 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.numberOfTaskSlots, 2
>> 2019-08-21 20:00:25,439 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.heap.size, 8192m
>> 2019-08-21 20:00:25,439 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.heap.size, 2048m
>> 2019-08-21 20:00:25,465 WARN
>> org.apache.flink.configuration.Configuration                  - Config uses
>> deprecated configuration key 'web.port' instead of proper key
>> 'rest.bind-port'
>> 2019-08-21 20:00:25,469 INFO
>> org.apache.flink.runtime.clusterframework.BootstrapTools      - Setting
>> directories for temporary files to:
>> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003
>> 2019-08-21 20:00:25,485 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Starting
>> YarnJobClusterEntrypoint.
>> 2019-08-21 20:00:25,485 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Install
>> default filesystem.
>> 2019-08-21 20:00:25,561 INFO
>> org.apache.flink.runtime.security.modules.HadoopModule        - Hadoop user
>> set to root (auth:SIMPLE)
>> 2019-08-21 20:00:25,580 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -
>> Initializing cluster services.
>> 2019-08-21 20:00:25,757 INFO
>> org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Trying to
>> start actor system at host53:0
>> 2019-08-21 20:00:26,187 INFO
>> akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger
>> started
>> 2019-08-21 20:00:26,251 INFO
>> akka.remote.Remoting                                          - Starting
>> remoting
>> 2019-08-21 20:00:26,439 INFO
>> akka.remote.Remoting                                          - Remoting
>> started; listening on addresses :[akka.tcp://flink@host53:59609]
>> 2019-08-21 20:00:26,447 INFO
>> org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Actor
>> system started at akka.tcp://flink@host53:59609
>> 2019-08-21 20:00:27,011 INFO
>> org.apache.flink.runtime.blob.FileSystemBlobStore             - Creating
>> highly available BLOB storage directory at
>> hdfs://host51:9000/flink/ha-yarn/application_1566383236573_0003/blob
>> 2019-08-21 20:00:27,097 INFO
>> org.apache.flink.runtime.util.ZooKeeperUtils                  - Enforcing
>> default ACL for ZK connections
>> 2019-08-21 20:00:27,097 INFO
>> org.apache.flink.runtime.util.ZooKeeperUtils                  - Using
>> '/flink-yarn/application_1566383236573_0003' as Zookeeper namespace.
>> 2019-08-21 20:00:27,161 INFO
>> org.apache.flink.shaded.curator.org.apache.curator.framework.imps.CuratorFrameworkImpl
>> - Starting
>> 2019-08-21 20:00:27,169 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:zookeeper.version=3.4.10-39d3a4f269333c922ed3db283be479f9deacaa0f,
>> built on 03/23/2017 10:13 GMT
>> 2019-08-21 20:00:27,169 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:host.name=host53
>> 2019-08-21 20:00:27,169 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:java.version=1.8.0_77
>> 2019-08-21 20:00:27,169 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:java.vendor=Oracle Corporation
>> 2019-08-21 20:00:27,169 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:java.library.path=:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:java.io.tmpdir=/tmp
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:java.compiler=<NA>
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:os.name=Linux
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:os.arch=amd64
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:os.version=3.16.0-0.bpo.4-amd64
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:user.name=root
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:user.home=/root
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:user.dir=/usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/container_1566383236573_0003_01_000001
>> 2019-08-21 20:00:27,171 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
>> Initiating client connection, connectString=host51:2181
>> sessionTimeout=60000
>> watcher=org.apache.flink.shaded.curator.org.apache.curator.ConnectionState@319854f0
>> 2019-08-21 20:00:27,184 WARN
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - SASL
>> configuration failed: javax.security.auth.login.LoginException: No JAAS
>> configuration section named 'Client' was found in specified JAAS
>> configuration file: '/tmp/jaas-4274640356638794035.conf'. Will continue
>> connection to Zookeeper server without SASL authentication, if Zookeeper
>> server allows it.
>> 2019-08-21 20:00:27,186 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  -
>> Opening socket connection to server host51/10.10.10.51:2181
>> 2019-08-21 20:00:27,187 ERROR
>> org.apache.flink.shaded.curator.org.apache.curator.ConnectionState  -
>> Authentication failed
>> 2019-08-21 20:00:27,187 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Socket
>> connection established to host51/10.10.10.51:2181, initiating session
>> 2019-08-21 20:00:27,193 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  -
>> Session establishment complete on server host51/10.10.10.51:2181,
>> sessionid = 0x1001a86de810008, negotiated timeout = 40000
>> 2019-08-21 20:00:27,193 INFO
>> org.apache.flink.runtime.blob.BlobServer                      - Created
>> BLOB server storage directory
>> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-eaa2bb2b-cddb-42e3-8ce5-bf2f83e8567a
>> 2019-08-21 20:00:27,195 INFO
>> org.apache.flink.shaded.curator.org.apache.curator.framework.state.ConnectionStateManager
>> - State change: CONNECTED
>> 2019-08-21 20:00:27,198 INFO
>> org.apache.flink.runtime.blob.BlobServer                      - Started
>> BLOB server at 0.0.0.0:58591 - max concurrent requests: 50 - max
>> backlog: 1000
>> 2019-08-21 20:00:27,213 INFO
>> org.apache.flink.runtime.metrics.MetricRegistryImpl           - No metrics
>> reporter configured, no metrics will be exposed/reported.
>> 2019-08-21 20:00:27,215 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Trying to
>> start actor system at host53:0
>> 2019-08-21 20:00:27,239 INFO
>> akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger
>> started
>> 2019-08-21 20:00:27,247 INFO
>> akka.remote.Remoting                                          - Starting
>> remoting
>> 2019-08-21 20:00:27,255 INFO
>> akka.remote.Remoting                                          - Remoting
>> started; listening on addresses :[akka.tcp://flink-metrics@host53:59941]
>> 2019-08-21 20:00:27,257 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Actor
>> system started at akka.tcp://flink-metrics@host53:59941
>> 2019-08-21 20:00:27,263 INFO
>> org.apache.flink.runtime.blob.TransientBlobCache              - Created
>> BLOB cache storage directory
>> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-405a4797-824f-48da-bb50-2ee2e230d249
>> 2019-08-21 20:00:27,296 WARN
>> org.apache.flink.configuration.Configuration                  - Config uses
>> deprecated configuration key 'web.port' instead of proper key
>> 'rest.bind-port'
>> 2019-08-21 20:00:27,298 WARN
>> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Upload
>> directory
>> /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload does
>> not exist, or has been deleted externally. Previously uploaded files are no
>> longer available.
>> 2019-08-21 20:00:27,298 INFO
>> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Created
>> directory
>> /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload for
>> file uploads.
>> 2019-08-21 20:00:27,320 INFO
>> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Starting
>> rest endpoint.
>> 2019-08-21 20:00:27,616 INFO
>> org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined
>> location of main cluster component log file:
>> /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.log
>> 2019-08-21 20:00:27,616 INFO
>> org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined
>> location of main cluster component stdout file:
>> /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.out
>> 2019-08-21 20:00:27,771 INFO
>> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Rest
>> endpoint listening at host53:46401
>> 2019-08-21 20:00:27,772 INFO
>> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
>> Starting ZooKeeperLeaderElectionService
>> ZooKeeperLeaderElectionService{leaderPath='/leader/rest_server_lock'}.
>> 2019-08-21 20:00:27,795 WARN
>> org.apache.flink.shaded.curator.org.apache.curator.utils.ZKPaths  - The
>> version of ZooKeeper being used doesn't support Container nodes.
>> CreateMode.PERSISTENT will be used instead.
>> 2019-08-21 20:00:27,807 INFO
>> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Web
>> frontend listening at http://host53:46401.
>> 2019-08-21 20:00:27,836 INFO
>> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  -
>> http://host53:46401 was granted leadership with
>> leaderSessionID=8487757f-fd82-42b2-a1a6-adcc69e1eba1
>> 2019-08-21 20:00:27,932 INFO
>> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
>> RPC endpoint for org.apache.flink.yarn.YarnResourceManager at
>> akka://flink/user/resourcemanager .
>> 2019-08-21 20:00:28,021 INFO
>> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
>> RPC endpoint for org.apache.flink.runtime.dispatcher.MiniDispatcher at
>> akka://flink/user/dispatcher .
>> 2019-08-21 20:00:28,033 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
>> 2019-08-21 20:00:28,033 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Starting ZooKeeperLeaderRetrievalService /leader/dispatcher_lock.
>> 2019-08-21 20:00:28,033 INFO
>> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
>> Starting ZooKeeperLeaderElectionService
>> ZooKeeperLeaderElectionService{leaderPath='/leader/dispatcher_lock'}.
>> 2019-08-21 20:00:28,045 INFO
>> org.apache.flink.runtime.dispatcher.MiniDispatcher            - Dispatcher
>> akka.tcp://flink@host53:59609/user/dispatcher was granted leadership
>> with fencing token 6216d17e-56b1-43bb-9bb3-f9fa5b8921be
>> 2019-08-21 20:00:28,048 INFO
>> org.apache.flink.runtime.dispatcher.MiniDispatcher            - Recovering
>> all persisted jobs.
>> 2019-08-21 20:00:28,054 INFO
>> org.apache.hadoop.yarn.client.RMProxy                         - Connecting
>> to ResourceManager at host51/10.10.10.51:8030
>> 2019-08-21 20:00:28,078 INFO
>> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
>> RPC endpoint for org.apache.flink.runtime.jobmaster.JobMaster at
>> akka://flink/user/jobmanager_0 .
>> 2019-08-21 20:00:28,086 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  -
>> Initializing job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
>> 2019-08-21 20:00:28,099 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  - Using
>> restart strategy
>> FixedDelayRestartStrategy(maxNumberRestartAttempts=2147483647,
>> delayBetweenRestartAttempts=0) for StreamKafkaMain
>> (e5ea83238d721e932b4645589660406c).
>> 2019-08-21 20:00:28,122 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job
>> recovers via failover strategy: full graph restart
>> 2019-08-21 20:00:28,153 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  - Running
>> initialization on master for job StreamKafkaMain
>> (e5ea83238d721e932b4645589660406c).
>> 2019-08-21 20:00:28,154 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  -
>> Successfully ran initialization on master in 0 ms.
>> 2019-08-21 20:00:28,187 INFO
>> org.apache.flink.runtime.util.ZooKeeperUtils                  - Initialized
>> ZooKeeperCompletedCheckpointStore in
>> '/checkpoints/e5ea83238d721e932b4645589660406c'.
>> 2019-08-21 20:00:28,196 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Recovered 0
>> containers from previous attempts ([]).
>> 2019-08-21 20:00:28,199 INFO
>> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
>> yarn.client.max-cached-nodemanagers-proxies : 0
>> 2019-08-21 20:00:28,200 INFO
>> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
>> Starting ZooKeeperLeaderElectionService
>> ZooKeeperLeaderElectionService{leaderPath='/leader/resource_manager_lock'}.
>> 2019-08-21 20:00:28,203 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  - No state
>> backend has been configured, using default (Memory / JobManager)
>> MemoryStateBackend (data in heap memory / checkpoints to JobManager)
>> (checkpoints: 'null', savepoints: 'null', asynchronous: TRUE, maxStateSize:
>> 5242880)
>> 2019-08-21 20:00:28,211 INFO
>> org.apache.flink.yarn.YarnResourceManager                     -
>> ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager was
>> granted leadership with fencing token 97537c0d61bbf091e37cb6c45b784568
>> 2019-08-21 20:00:28,214 INFO
>> org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  -
>> Starting the SlotManager.
>> 2019-08-21 20:00:28,225 INFO
>> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
>> Recovering checkpoints from ZooKeeper.
>> 2019-08-21 20:00:28,229 INFO
>> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
>> Found 0 checkpoints in ZooKeeper.
>> 2019-08-21 20:00:28,229 INFO
>> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
>> Trying to fetch 0 checkpoints from storage.
>> 2019-08-21 20:00:28,233 INFO
>> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
>> Starting ZooKeeperLeaderElectionService
>> ZooKeeperLeaderElectionService{leaderPath='/leader/e5ea83238d721e932b4645589660406c/job_manager_lock'}.
>> 2019-08-21 20:00:28,248 INFO
>> org.apache.flink.runtime.jobmaster.JobManagerRunner           - JobManager
>> runner for job StreamKafkaMain (e5ea83238d721e932b4645589660406c) was
>> granted leadership with session id 12c96c94-ebbd-4991-a457-6f0320d71bc1 at
>> akka.tcp://flink@host53:59609/user/jobmanager_0.
>> 2019-08-21 20:00:28,260 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
>> 2019-08-21 20:00:28,260 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  - Starting
>> execution of job StreamKafkaMain (e5ea83238d721e932b4645589660406c) under
>> job master id a4576f0320d71bc112c96c94ebbd4991.
>> 2019-08-21 20:00:28,261 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job
>> StreamKafkaMain (e5ea83238d721e932b4645589660406c) switched from state
>> CREATED to RUNNING.
>> 2019-08-21 20:00:28,266 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
>> (ee39866835f7b7387f518e352787a532) switched from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,280 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
>> serve slot request, no ResourceManager connected. Adding as pending request
>> [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}]
>> 2019-08-21 20:00:28,287 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
>> (f1a893fc5e2440331c8e6db1874ad220) switched from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,288 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
>> serve slot request, no ResourceManager connected. Adding as pending request
>> [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}]
>> 2019-08-21 20:00:28,288 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
>> (4756a0450881325af5fb396a18dbb8ae) switched from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,289 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
>> serve slot request, no ResourceManager connected. Adding as pending request
>> [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}]
>> 2019-08-21 20:00:28,289 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
>> (78f520582607e26b365fca483fc98d4c) switched from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,289 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
>> serve slot request, no ResourceManager connected. Adding as pending request
>> [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}]
>> 2019-08-21 20:00:28,289 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
>> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
>> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
>> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select:
>> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
>> (0071e1c7c4f70646914329f81dcbd349) switched from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,291 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from CREATED
>> to SCHEDULED.
>> 2019-08-21 20:00:28,291 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from CREATED
>> to SCHEDULED.
>> 2019-08-21 20:00:28,291 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from CREATED
>> to SCHEDULED.
>> 2019-08-21 20:00:28,291 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from CREATED
>> to SCHEDULED.
>> 2019-08-21 20:00:28,291 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched
>> from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,291 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched
>> from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,291 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched
>> from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,292 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched
>> from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,295 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  - Connecting
>> to ResourceManager akka.tcp://flink@host53
>> :59609/user/resourcemanager(97537c0d61bbf091e37cb6c45b784568)
>> 2019-08-21 20:00:28,300 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  - Resolved
>> ResourceManager address, beginning registration
>> 2019-08-21 20:00:28,300 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  -
>> Registration at ResourceManager attempt 1 (timeout=100ms)
>> 2019-08-21 20:00:28,304 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Starting ZooKeeperLeaderRetrievalService
>> /leader/e5ea83238d721e932b4645589660406c/job_manager_lock.
>> 2019-08-21 20:00:28,304 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Registering
>> job manager [hidden email]://flink@host53:59609/user/jobmanager_0
>> for job e5ea83238d721e932b4645589660406c.
>> 2019-08-21 20:00:28,310 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Registered
>> job manager [hidden email]://flink@host53:59609/user/jobmanager_0
>> for job e5ea83238d721e932b4645589660406c.
>> 2019-08-21 20:00:28,313 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  - JobManager
>> successfully registered at ResourceManager, leader id:
>> 97537c0d61bbf091e37cb6c45b784568.
>> 2019-08-21 20:00:28,314 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
>> new slot [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}] and profile
>> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
>> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
>> 2019-08-21 20:00:28,315 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Request
>> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
>> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
>> e5ea83238d721e932b4645589660406c with allocation id
>> faa6a521758e5be4b3dc1955ca0f51b4.
>> 2019-08-21 20:00:28,316 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
>> new slot [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}] and profile
>> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
>> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
>> 2019-08-21 20:00:28,316 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
>> new slot [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}] and profile
>> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
>> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
>> 2019-08-21 20:00:28,316 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
>> new slot [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}] and profile
>> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
>> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
>> 2019-08-21 20:00:28,321 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Requesting
>> new TaskExecutor container with resources <memory:8192, vCores:2>. Number
>> pending requests 1.
>> 2019-08-21 20:00:28,322 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Request
>> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
>> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
>> e5ea83238d721e932b4645589660406c with allocation id
>> a2b419aea4a373686eeed6c46163a528.
>> 2019-08-21 20:00:28,323 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Request
>> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
>> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
>> e5ea83238d721e932b4645589660406c with allocation id
>> cbc11737ce6729cfe45f24c43180b5c8.
>> 2019-08-21 20:00:28,323 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Requesting
>> new TaskExecutor container with resources <memory:8192, vCores:2>. Number
>> pending requests 2.
>> 2019-08-21 20:00:28,323 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Request
>> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
>> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
>> e5ea83238d721e932b4645589660406c with allocation id
>> f6566567c173a55b2bc52b72bcb24ff8.
>> 2019-08-21 20:00:31,597 INFO
>> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint
>> triggering task Source: Custom Source -> Flat Map -> Filter -> Filter ->
>> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong,
>> rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute:
>> (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state
>> RUNNING but SCHEDULED instead. Aborting checkpoint.
>> 2019-08-21 20:00:33,749 INFO
>> org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received
>> new token for : host53:59938
>> 2019-08-21 20:00:33,749 INFO
>> org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received
>> new token for : host52:33246
>> 2019-08-21 20:00:33,765 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Received
>> new container: container_1566383236573_0003_01_000002 - Remaining pending
>> container requests: 2
>> 2019-08-21 20:00:33,765 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Removing
>> container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending
>> container requests 1.
>> 2019-08-21 20:00:34,009 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Creating
>> container launch context for TaskManagers
>> 2019-08-21 20:00:34,010 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Starting
>> TaskManagers
>> 2019-08-21 20:00:34,021 INFO
>> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
>> Opening proxy : host53:59938
>> 2019-08-21 20:00:34,066 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Received
>> new container: container_1566383236573_0003_01_000003 - Remaining pending
>> container requests: 1
>> 2019-08-21 20:00:34,066 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Removing
>> container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending
>> container requests 0.
>> 2019-08-21 20:00:34,088 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Creating
>> container launch context for TaskManagers
>> 2019-08-21 20:00:34,089 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Starting
>> TaskManagers
>> 2019-08-21 20:00:34,090 INFO
>> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
>> Opening proxy : host52:33246
>> 2019-08-21 20:00:36,597 INFO
>> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint
>> triggering task Source: Custom Source -> Flat Map -> Filter -> Filter ->
>> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong,
>> rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute:
>> (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state
>> RUNNING but SCHEDULED instead. Aborting checkpoint.
>> 2019-08-21 20:00:37,856 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Registering
>> TaskManager with ResourceID container_1566383236573_0003_01_000002
>> (akka.tcp://flink@host53:34608/user/taskmanager_0) at ResourceManager
>> 2019-08-21 20:00:39,573 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Registering
>> TaskManager with ResourceID container_1566383236573_0003_01_000003
>> (akka.tcp://flink@host52:32931/user/taskmanager_0) at ResourceManager
>> 2019-08-21 20:00:39,659 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
>> (ee39866835f7b7387f518e352787a532) switched from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,659 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
>> (attempt #0) to container_1566383236573_0003_01_000002 @ host53
>> (dataPort=52456)
>> 2019-08-21 20:00:39,664 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
>> (f1a893fc5e2440331c8e6db1874ad220) switched from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,664 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
>> (attempt #0) to container_1566383236573_0003_01_000002 @ host53
>> (dataPort=52456)
>> 2019-08-21 20:00:39,665 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
>> (4756a0450881325af5fb396a18dbb8ae) switched from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,666 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
>> (attempt #0) to container_1566383236573_0003_01_000003 @ host52
>> (dataPort=39650)
>> 2019-08-21 20:00:39,666 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
>> (78f520582607e26b365fca483fc98d4c) switched from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,666 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
>> (attempt #0) to container_1566383236573_0003_01_000003 @ host52
>> (dataPort=39650)
>> 2019-08-21 20:00:39,666 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
>> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
>> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
>> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select:
>> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
>> (0071e1c7c4f70646914329f81dcbd349) switched from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,666 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select:
>> (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start,
>> end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime)
>> -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (attempt
>> #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
>> 2019-08-21 20:00:39,670 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from
>> SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,670 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Sink: Print to Std. Out (1/4) (attempt #0) to
>> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
>> 2019-08-21 20:00:39,671 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from
>> SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,672 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Sink: Print to Std. Out (2/4) (attempt #0) to
>> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
>> 2019-08-21 20:00:39,672 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from
>> SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,672 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Sink: Print to Std. Out (3/4) (attempt #0) to
>> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
>> 2019-08-21 20:00:39,672 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from
>> SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,673 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Sink: Print to Std. Out (4/4) (attempt #0) to
>> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
>> 2019-08-21 20:00:39,673 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched
>> from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,673 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Map -> Sink: Print to Std. Out (1/4) (attempt #0) to
>> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
>> 2019-08-21 20:00:39,675 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched
>> from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,675 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Map -> Sink: Print to Std. Out (2/4) (attempt #0) to
>> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
>> 2019-08-21 20:00:39,675 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched
>> from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,675 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Map -> Sink: Print to Std. Out (3/4) (attempt #0) to
>> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
>> 2019-08-21 20:00:39,676 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched
>> from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,676 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Map -> Sink: Print to Std. Out (4/4) (attempt #0) to
>> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
>> 2019-08-21 20:00:39,782 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from
>> DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,783 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from
>> DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,784 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched
>> from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,787 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched
>> from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,806 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
>> (f1a893fc5e2440331c8e6db1874ad220) switched from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,809 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
>> (ee39866835f7b7387f518e352787a532) switched from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,809 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
>> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
>> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
>> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select:
>> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
>> (0071e1c7c4f70646914329f81dcbd349) switched from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,991 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from
>> DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,992 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from
>> DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,996 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched
>> from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,997 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched
>> from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:40,005 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
>> (78f520582607e26b365fca483fc98d4c) switched from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:40,006 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
>> (4756a0450881325af5fb396a18dbb8ae) switched from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:41,605 INFO
>> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Triggering
>> checkpoint 1 @ 1566388841597 for job e5ea83238d721e932b4645589660406c.
>> 2019-08-21 20:00:42,466 INFO
>> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Completed
>> checkpoint 1 for job e5ea83238d721e932b4645589660406c (7994 bytes in 821
>> ms).
>>
>>
>>
>> ------------------ 原始邮件 ------------------
>> 发件人: "Yun Tang"<[hidden email]>;
>> 发送时间: 2019年8月22日(星期四) 中午11:08
>> 收件人: "user-zh"<[hidden email]>;
>>
>> 主题: Re: flink启动等待10分钟问题
>>
>>
>>
>> Hi
>>
>> Flink on YARN作业启动时间长,有很多原因,例如资源不够在等待,container申请的时候又退出了。默认的slot
>> request的timeout时间是5min,感觉你的作业应该是可能遇到了一个slot request
>> timeout,然后又重新申请。最好能提供一下jobmanager的日志才好进一步分析。
>>
>> 祝好
>> 唐云
>> ________________________________
>> From: 々守护々 <[hidden email]>
>> Sent: Thursday, August 22, 2019 11:04
>> To: user-zh <[hidden email]>
>> Subject: flink启动等待10分钟问题
>>
>> 各位Flink社区大佬,您好!
>>       我使用Flink SQL (Flink 1.8.1),在hadoop 集群中调整了一下yarn-site.xml文件的内存参数:
>>         <property>
>>                 <name>yarn.nodemanager.resource.memory-mb</name>
>>                 <value>16384</value>
>>         </property>
>>         <property>
>>                 <name>yarn.scheduler.minimum-allocation-mb</name>
>>                 <value>1024</value>
>>         </property>
>>         <property>
>>                 <name>yarn.scheduler.maximum-allocation-mb</name>
>>                 <value>8192</value>
>>         </property>
>>
>>
>>
>> 然后我去启动flink on yarn任务,启动参数:./flink-1.8.1/bin/flink run -m yarn-cluster -p
>> 4 -d -c streaming.StreamKafkaMain -yn 4 -ys 2 -yjm 2048 -ytm 8192  -ynm
>> bigdata6 ./flink_bigdata6.jar
>> 最终能能启动成功,就是启动要等待10分钟,这个等待时间太长了,请问各位大佬有什么解决的方法吗?
>>
>
Reply | Threaded
Open this post in threaded view
|

回复: 回复: flink启动等待10分钟问题

守护
In reply to this post by tangjunliang@huitongjy.com
是的,集群的资源充足,附件有截图,就停在那不往下走了。


------------------ 原始邮件 ------------------
发送时间: 2019年8月22日(星期四) 中午11:34
收件人: "user-zh"<[hidden email]>;
主题: 回复: 回复: flink启动等待10分钟问题

当时你的集群资源是否充足?



[hidden email]
 
发件人: 々守护々
发送时间: 2019-08-22 11:32
收件人: user-zh
主题: 回复: 回复: flink启动等待10分钟问题
是这样的,在启动的是后客户端日志就停在那了,等待10分钟后才能生成yarn的application


------------------ 原始邮件 ------------------
发件人: "Yun Tang"<[hidden email]>;
发送时间: 2019年8月22日(星期四) 中午11:23
收件人: "user-zh"<[hidden email]>;
主题: Re: 回复: flink启动等待10分钟问题

启动时间是 20:00:25,task都处于running甚至第一次checkpoint completed的时间是20:00:42,一共才17秒,何来10分钟的问题?
________________________________
From: 々守护々 <[hidden email]>
Sent: Thursday, August 22, 2019 11:18
To: user-zh <[hidden email]>
Subject: 回复: flink启动等待10分钟问题

您好,这个是我jobmanager启动日志,请帮忙看看,谢谢!


2019-08-21 20:00:25,428 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - --------------------------------------------------------------------------------
2019-08-21 20:00:25,430 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Registered UNIX signal handlers for [TERM, HUP, INT]
2019-08-21 20:00:25,433 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - YARN daemon is running as: root Yarn client user obtainer: root
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.cluster-id, application_1566383236573_0003
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.address, localhost
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.rpc.port, 6123
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.zookeeper.path.root, /flink-yarn
2019-08-21 20:00:25,437 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.storageDir, hdfs://host51:9000/flink/ha-yarn
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability.zookeeper.quorum, host51:2181
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: internal.cluster.execution-mode, DETACHED
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: high-availability, zookeeper
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: parallelism.default, 1
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: yarn.application-attempts, 10
2019-08-21 20:00:25,438 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.numberOfTaskSlots, 2
2019-08-21 20:00:25,439 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: taskmanager.heap.size, 8192m
2019-08-21 20:00:25,439 INFO  org.apache.flink.configuration.GlobalConfiguration            - Loading configuration property: jobmanager.heap.size, 2048m
2019-08-21 20:00:25,465 WARN  org.apache.flink.configuration.Configuration                  - Config uses deprecated configuration key 'web.port' instead of proper key 'rest.bind-port'
2019-08-21 20:00:25,469 INFO  org.apache.flink.runtime.clusterframework.BootstrapTools      - Setting directories for temporary files to: /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003
2019-08-21 20:00:25,485 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Starting YarnJobClusterEntrypoint.
2019-08-21 20:00:25,485 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Install default filesystem.
2019-08-21 20:00:25,561 INFO  org.apache.flink.runtime.security.modules.HadoopModule        - Hadoop user set to root (auth:SIMPLE)
2019-08-21 20:00:25,580 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Initializing cluster services.
2019-08-21 20:00:25,757 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Trying to start actor system at host53:0
2019-08-21 20:00:26,187 INFO  akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger started
2019-08-21 20:00:26,251 INFO  akka.remote.Remoting                                          - Starting remoting
2019-08-21 20:00:26,439 INFO  akka.remote.Remoting                                          - Remoting started; listening on addresses :[akka.tcp://flink@host53:59609]
2019-08-21 20:00:26,447 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Actor system started at akka.tcp://flink@host53:59609
2019-08-21 20:00:27,011 INFO  org.apache.flink.runtime.blob.FileSystemBlobStore             - Creating highly available BLOB storage directory at hdfs://host51:9000/flink/ha-yarn/application_1566383236573_0003/blob
2019-08-21 20:00:27,097 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Enforcing default ACL for ZK connections
2019-08-21 20:00:27,097 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Using '/flink-yarn/application_1566383236573_0003' as Zookeeper namespace.
2019-08-21 20:00:27,161 INFO  org.apache.flink.shaded.curator.org.apache.curator.framework.imps.CuratorFrameworkImpl  - Starting
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:zookeeper.version=3.4.10-39d3a4f269333c922ed3db283be479f9deacaa0f, built on 03/23/2017 10:13 GMT
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:host.name=host53
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.version=1.8.0_77
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.vendor=Oracle Corporation
2019-08-21 20:00:27,169 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.library.path=:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.io.tmpdir=/tmp
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:java.compiler=<NA>
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:os.name=Linux
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:os.arch=amd64
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:os.version=3.16.0-0.bpo.4-amd64
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:user.name=root
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:user.home=/root
2019-08-21 20:00:27,170 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client environment:user.dir=/usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/container_1566383236573_0003_01_000001
2019-08-21 20:00:27,171 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Initiating client connection, connectString=host51:2181 sessionTimeout=60000 watcher=org.apache.flink.shaded.curator.org.apache.curator.ConnectionState@319854f0
2019-08-21 20:00:27,184 WARN  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - SASL configuration failed: javax.security.auth.login.LoginException: No JAAS configuration section named 'Client' was found in specified JAAS configuration file: '/tmp/jaas-4274640356638794035.conf'. Will continue connection to Zookeeper server without SASL authentication, if Zookeeper server allows it.
2019-08-21 20:00:27,186 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Opening socket connection to server host51/10.10.10.51:2181
2019-08-21 20:00:27,187 ERROR org.apache.flink.shaded.curator.org.apache.curator.ConnectionState  - Authentication failed
2019-08-21 20:00:27,187 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Socket connection established to host51/10.10.10.51:2181, initiating session
2019-08-21 20:00:27,193 INFO  org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Session establishment complete on server host51/10.10.10.51:2181, sessionid = 0x1001a86de810008, negotiated timeout = 40000
2019-08-21 20:00:27,193 INFO  org.apache.flink.runtime.blob.BlobServer                      - Created BLOB server storage directory /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-eaa2bb2b-cddb-42e3-8ce5-bf2f83e8567a
2019-08-21 20:00:27,195 INFO  org.apache.flink.shaded.curator.org.apache.curator.framework.state.ConnectionStateManager  - State change: CONNECTED
2019-08-21 20:00:27,198 INFO  org.apache.flink.runtime.blob.BlobServer                      - Started BLOB server at 0.0.0.0:58591 - max concurrent requests: 50 - max backlog: 1000
2019-08-21 20:00:27,213 INFO  org.apache.flink.runtime.metrics.MetricRegistryImpl           - No metrics reporter configured, no metrics will be exposed/reported.
2019-08-21 20:00:27,215 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Trying to start actor system at host53:0
2019-08-21 20:00:27,239 INFO  akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger started
2019-08-21 20:00:27,247 INFO  akka.remote.Remoting                                          - Starting remoting
2019-08-21 20:00:27,255 INFO  akka.remote.Remoting                                          - Remoting started; listening on addresses :[akka.tcp://flink-metrics@host53:59941]
2019-08-21 20:00:27,257 INFO  org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Actor system started at akka.tcp://flink-metrics@host53:59941
2019-08-21 20:00:27,263 INFO  org.apache.flink.runtime.blob.TransientBlobCache              - Created BLOB cache storage directory /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-405a4797-824f-48da-bb50-2ee2e230d249
2019-08-21 20:00:27,296 WARN  org.apache.flink.configuration.Configuration                  - Config uses deprecated configuration key 'web.port' instead of proper key 'rest.bind-port'
2019-08-21 20:00:27,298 WARN  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Upload directory /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload does not exist, or has been deleted externally. Previously uploaded files are no longer available.
2019-08-21 20:00:27,298 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Created directory /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload for file uploads.
2019-08-21 20:00:27,320 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Starting rest endpoint.
2019-08-21 20:00:27,616 INFO  org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined location of main cluster component log file: /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.log
2019-08-21 20:00:27,616 INFO  org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined location of main cluster component stdout file: /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.out
2019-08-21 20:00:27,771 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Rest endpoint listening at host53:46401
2019-08-21 20:00:27,772 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/rest_server_lock'}.
2019-08-21 20:00:27,795 WARN  org.apache.flink.shaded.curator.org.apache.curator.utils.ZKPaths  - The version of ZooKeeper being used doesn't support Container nodes. CreateMode.PERSISTENT will be used instead.
2019-08-21 20:00:27,807 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Web frontend listening at http://host53:46401.
2019-08-21 20:00:27,836 INFO  org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - http://host53:46401 was granted leadership with leaderSessionID=8487757f-fd82-42b2-a1a6-adcc69e1eba1
2019-08-21 20:00:27,932 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting RPC endpoint for org.apache.flink.yarn.YarnResourceManager at akka://flink/user/resourcemanager .
2019-08-21 20:00:28,021 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting RPC endpoint for org.apache.flink.runtime.dispatcher.MiniDispatcher at akka://flink/user/dispatcher .
2019-08-21 20:00:28,033 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
2019-08-21 20:00:28,033 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/dispatcher_lock.
2019-08-21 20:00:28,033 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/dispatcher_lock'}.
2019-08-21 20:00:28,045 INFO  org.apache.flink.runtime.dispatcher.MiniDispatcher            - Dispatcher akka.tcp://flink@host53:59609/user/dispatcher was granted leadership with fencing token 6216d17e-56b1-43bb-9bb3-f9fa5b8921be
2019-08-21 20:00:28,048 INFO  org.apache.flink.runtime.dispatcher.MiniDispatcher            - Recovering all persisted jobs.
2019-08-21 20:00:28,054 INFO  org.apache.hadoop.yarn.client.RMProxy                         - Connecting to ResourceManager at host51/10.10.10.51:8030
2019-08-21 20:00:28,078 INFO  org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting RPC endpoint for org.apache.flink.runtime.jobmaster.JobMaster at akka://flink/user/jobmanager_0 .
2019-08-21 20:00:28,086 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Initializing job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
2019-08-21 20:00:28,099 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Using restart strategy FixedDelayRestartStrategy(maxNumberRestartAttempts=2147483647, delayBetweenRestartAttempts=0) for StreamKafkaMain (e5ea83238d721e932b4645589660406c).
2019-08-21 20:00:28,122 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job recovers via failover strategy: full graph restart
2019-08-21 20:00:28,153 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Running initialization on master for job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
2019-08-21 20:00:28,154 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Successfully ran initialization on master in 0 ms.
2019-08-21 20:00:28,187 INFO  org.apache.flink.runtime.util.ZooKeeperUtils                  - Initialized ZooKeeperCompletedCheckpointStore in '/checkpoints/e5ea83238d721e932b4645589660406c'.
2019-08-21 20:00:28,196 INFO  org.apache.flink.yarn.YarnResourceManager                     - Recovered 0 containers from previous attempts ([]).
2019-08-21 20:00:28,199 INFO  org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  - yarn.client.max-cached-nodemanagers-proxies : 0
2019-08-21 20:00:28,200 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/resource_manager_lock'}.
2019-08-21 20:00:28,203 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - No state backend has been configured, using default (Memory / JobManager) MemoryStateBackend (data in heap memory / checkpoints to JobManager) (checkpoints: 'null', savepoints: 'null', asynchronous: TRUE, maxStateSize: 5242880)
2019-08-21 20:00:28,211 INFO  org.apache.flink.yarn.YarnResourceManager                     - ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager was granted leadership with fencing token 97537c0d61bbf091e37cb6c45b784568
2019-08-21 20:00:28,214 INFO  org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  - Starting the SlotManager.
2019-08-21 20:00:28,225 INFO  org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  - Recovering checkpoints from ZooKeeper.
2019-08-21 20:00:28,229 INFO  org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  - Found 0 checkpoints in ZooKeeper.
2019-08-21 20:00:28,229 INFO  org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  - Trying to fetch 0 checkpoints from storage.
2019-08-21 20:00:28,233 INFO  org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  - Starting ZooKeeperLeaderElectionService ZooKeeperLeaderElectionService{leaderPath='/leader/e5ea83238d721e932b4645589660406c/job_manager_lock'}.
2019-08-21 20:00:28,248 INFO  org.apache.flink.runtime.jobmaster.JobManagerRunner           - JobManager runner for job StreamKafkaMain (e5ea83238d721e932b4645589660406c) was granted leadership with session id 12c96c94-ebbd-4991-a457-6f0320d71bc1 at akka.tcp://flink@host53:59609/user/jobmanager_0.
2019-08-21 20:00:28,260 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
2019-08-21 20:00:28,260 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Starting execution of job StreamKafkaMain (e5ea83238d721e932b4645589660406c) under job master id a4576f0320d71bc112c96c94ebbd4991.
2019-08-21 20:00:28,261 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job StreamKafkaMain (e5ea83238d721e932b4645589660406c) switched from state CREATED to RUNNING.
2019-08-21 20:00:28,266 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (ee39866835f7b7387f518e352787a532) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,280 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}]
2019-08-21 20:00:28,287 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (f1a893fc5e2440331c8e6db1874ad220) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,288 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}]
2019-08-21 20:00:28,288 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (4756a0450881325af5fb396a18dbb8ae) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}]
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (78f520582607e26b365fca483fc98d4c) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot serve slot request, no ResourceManager connected. Adding as pending request [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}]
2019-08-21 20:00:28,289 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (0071e1c7c4f70646914329f81dcbd349) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,291 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,292 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched from CREATED to SCHEDULED.
2019-08-21 20:00:28,295 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Connecting to ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager(97537c0d61bbf091e37cb6c45b784568)
2019-08-21 20:00:28,300 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Resolved ResourceManager address, beginning registration
2019-08-21 20:00:28,300 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - Registration at ResourceManager attempt 1 (timeout=100ms)
2019-08-21 20:00:28,304 INFO  org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  - Starting ZooKeeperLeaderRetrievalService /leader/e5ea83238d721e932b4645589660406c/job_manager_lock.
2019-08-21 20:00:28,304 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registering job manager [hidden email]://flink@host53:59609/user/jobmanager_0 for job e5ea83238d721e932b4645589660406c.
2019-08-21 20:00:28,310 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registered job manager [hidden email]://flink@host53:59609/user/jobmanager_0 for job e5ea83238d721e932b4645589660406c.
2019-08-21 20:00:28,313 INFO  org.apache.flink.runtime.jobmaster.JobMaster                  - JobManager successfully registered at ResourceManager, leader id: 97537c0d61bbf091e37cb6c45b784568.
2019-08-21 20:00:28,314 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,315 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id faa6a521758e5be4b3dc1955ca0f51b4.
2019-08-21 20:00:28,316 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,316 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,316 INFO  org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting new slot [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}] and profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
2019-08-21 20:00:28,321 INFO  org.apache.flink.yarn.YarnResourceManager                     - Requesting new TaskExecutor container with resources <memory:8192, vCores:2>. Number pending requests 1.
2019-08-21 20:00:28,322 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id a2b419aea4a373686eeed6c46163a528.
2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id cbc11737ce6729cfe45f24c43180b5c8.
2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager                     - Requesting new TaskExecutor container with resources <memory:8192, vCores:2>. Number pending requests 2.
2019-08-21 20:00:28,323 INFO  org.apache.flink.yarn.YarnResourceManager                     - Request slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job e5ea83238d721e932b4645589660406c with allocation id f6566567c173a55b2bc52b72bcb24ff8.
2019-08-21 20:00:31,597 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint triggering task Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state RUNNING but SCHEDULED instead. Aborting checkpoint.
2019-08-21 20:00:33,749 INFO  org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received new token for : host53:59938
2019-08-21 20:00:33,749 INFO  org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received new token for : host52:33246
2019-08-21 20:00:33,765 INFO  org.apache.flink.yarn.YarnResourceManager                     - Received new container: container_1566383236573_0003_01_000002 - Remaining pending container requests: 2
2019-08-21 20:00:33,765 INFO  org.apache.flink.yarn.YarnResourceManager                     - Removing container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending container requests 1.
2019-08-21 20:00:34,009 INFO  org.apache.flink.yarn.YarnResourceManager                     - Creating container launch context for TaskManagers
2019-08-21 20:00:34,010 INFO  org.apache.flink.yarn.YarnResourceManager                     - Starting TaskManagers
2019-08-21 20:00:34,021 INFO  org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  - Opening proxy : host53:59938
2019-08-21 20:00:34,066 INFO  org.apache.flink.yarn.YarnResourceManager                     - Received new container: container_1566383236573_0003_01_000003 - Remaining pending container requests: 1
2019-08-21 20:00:34,066 INFO  org.apache.flink.yarn.YarnResourceManager                     - Removing container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending container requests 0.
2019-08-21 20:00:34,088 INFO  org.apache.flink.yarn.YarnResourceManager                     - Creating container launch context for TaskManagers
2019-08-21 20:00:34,089 INFO  org.apache.flink.yarn.YarnResourceManager                     - Starting TaskManagers
2019-08-21 20:00:34,090 INFO  org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  - Opening proxy : host52:33246
2019-08-21 20:00:36,597 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint triggering task Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state RUNNING but SCHEDULED instead. Aborting checkpoint.
2019-08-21 20:00:37,856 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registering TaskManager with ResourceID container_1566383236573_0003_01_000002 (akka.tcp://flink@host53:34608/user/taskmanager_0) at ResourceManager
2019-08-21 20:00:39,573 INFO  org.apache.flink.yarn.YarnResourceManager                     - Registering TaskManager with ResourceID container_1566383236573_0003_01_000003 (akka.tcp://flink@host52:32931/user/taskmanager_0) at ResourceManager
2019-08-21 20:00:39,659 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (ee39866835f7b7387f518e352787a532) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,659 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,664 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (f1a893fc5e2440331c8e6db1874ad220) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,664 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,665 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (4756a0450881325af5fb396a18dbb8ae) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (78f520582607e26b365fca483fc98d4c) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (0071e1c7c4f70646914329f81dcbd349) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,666 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,670 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,670 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (1/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,671 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (2/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (3/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,672 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,673 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Sink: Print to Std. Out (4/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,673 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,673 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (1/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (2/4) (attempt #0) to container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,675 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (3/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,676 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched from SCHEDULED to DEPLOYING.
2019-08-21 20:00:39,676 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying Map -> Sink: Print to Std. Out (4/4) (attempt #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
2019-08-21 20:00:39,782 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,783 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,784 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,787 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,806 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4) (f1a893fc5e2440331c8e6db1874ad220) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,809 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4) (ee39866835f7b7387f518e352787a532) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,809 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (0071e1c7c4f70646914329f81dcbd349) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,991 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,992 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,996 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:39,997 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map -> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:40,005 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4) (78f520582607e26b365fca483fc98d4c) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:40,006 INFO  org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source: Custom Source -> Flat Map -> Filter -> Filter -> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4) (4756a0450881325af5fb396a18dbb8ae) switched from DEPLOYING to RUNNING.
2019-08-21 20:00:41,605 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Triggering checkpoint 1 @ 1566388841597 for job e5ea83238d721e932b4645589660406c.
2019-08-21 20:00:42,466 INFO  org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Completed checkpoint 1 for job e5ea83238d721e932b4645589660406c (7994 bytes in 821 ms).



------------------ 原始邮件 ------------------
发件人: "Yun Tang"<[hidden email]>;
发送时间: 2019年8月22日(星期四) 中午11:08
收件人: "user-zh"<[hidden email]>;

主题: Re: flink启动等待10分钟问题



Hi

Flink on YARN作业启动时间长,有很多原因,例如资源不够在等待,container申请的时候又退出了。默认的slot request的timeout时间是5min,感觉你的作业应该是可能遇到了一个slot request timeout,然后又重新申请。最好能提供一下jobmanager的日志才好进一步分析。

祝好
唐云
________________________________
From: 々守护々 <[hidden email]>
Sent: Thursday, August 22, 2019 11:04
To: user-zh <[hidden email]>
Subject: flink启动等待10分钟问题

各位Flink社区大佬,您好!
      我使用Flink SQL (Flink 1.8.1),在hadoop 集群中调整了一下yarn-site.xml文件的内存参数:
        <property>
                <name>yarn.nodemanager.resource.memory-mb</name>
                <value>16384</value>
        </property>
        <property>
                <name>yarn.scheduler.minimum-allocation-mb</name>
                <value>1024</value>
        </property>
        <property>
                <name>yarn.scheduler.maximum-allocation-mb</name>
                <value>8192</value>
        </property>



然后我去启动flink on yarn任务,启动参数:./flink-1.8.1/bin/flink run -m yarn-cluster -p 4 -d -c streaming.StreamKafkaMain -yn 4 -ys 2 -yjm 2048 -ytm 8192  -ynm bigdata6 ./flink_bigdata6.jar
最终能能启动成功,就是启动要等待10分钟,这个等待时间太长了,请问各位大佬有什么解决的方法吗?
Reply | Threaded
Open this post in threaded view
|

Re: 回复: flink启动等待10分钟问题

tison
图看不到,可以用邮件附件或者找个第三方图库...

Best,
tison.


々守护々 <[hidden email]> 于2019年8月22日周四 上午11:39写道:

> 是的,集群的资源充足,附件有截图,就停在那不往下走了。
>
>
> ------------------ 原始邮件 ------------------
> *发件人:* "[hidden email]"<[hidden email]>;
> *发送时间:* 2019年8月22日(星期四) 中午11:34
> *收件人:* "user-zh"<[hidden email]>;
> *主题:* 回复: 回复: flink启动等待10分钟问题
>
> 当时你的集群资源是否充足?
>
>
>
> [hidden email]
>
> 发件人: 々守护々
> 发送时间: 2019-08-22 11:32
> 收件人: user-zh
> 主题: 回复: 回复: flink启动等待10分钟问题
> 是这样的,在启动的是后客户端日志就停在那了,等待10分钟后才能生成yarn的application
>
>
> ------------------ 原始邮件 ------------------
> 发件人: "Yun Tang"<[hidden email]>;
> 发送时间: 2019年8月22日(星期四) 中午11:23
> 收件人: "user-zh"<[hidden email]>;
> 主题: Re: 回复: flink启动等待10分钟问题
>
> 启动时间是 20:00:25,task都处于running甚至第一次checkpoint
> completed的时间是20:00:42,一共才17秒,何来10分钟的问题?
> ________________________________
> From: 々守护々 <[hidden email]>
> Sent: Thursday, August 22, 2019 11:18
> To: user-zh <[hidden email]>
> Subject: 回复: flink启动等待10分钟问题
>
> 您好,这个是我jobmanager启动日志,请帮忙看看,谢谢!
>
>
> 2019-08-21 20:00:25,428 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -
> --------------------------------------------------------------------------------
> 2019-08-21 20:00:25,430 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Registered
> UNIX signal handlers for [TERM, HUP, INT]
> 2019-08-21 20:00:25,433 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - YARN daemon
> is running as: root Yarn client user obtainer: root
> 2019-08-21 20:00:25,437 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: high-availability.cluster-id,
> application_1566383236573_0003
> 2019-08-21 20:00:25,437 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.rpc.address, localhost
> 2019-08-21 20:00:25,437 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.rpc.port, 6123
> 2019-08-21 20:00:25,437 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: high-availability.zookeeper.path.root, /flink-yarn
> 2019-08-21 20:00:25,437 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: high-availability.storageDir,
> hdfs://host51:9000/flink/ha-yarn
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: high-availability.zookeeper.quorum, host51:2181
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: internal.cluster.execution-mode, DETACHED
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: high-availability, zookeeper
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: parallelism.default, 1
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: yarn.application-attempts, 10
> 2019-08-21 20:00:25,438 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.numberOfTaskSlots, 2
> 2019-08-21 20:00:25,439 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: taskmanager.heap.size, 8192m
> 2019-08-21 20:00:25,439 INFO
> org.apache.flink.configuration.GlobalConfiguration            - Loading
> configuration property: jobmanager.heap.size, 2048m
> 2019-08-21 20:00:25,465 WARN
> org.apache.flink.configuration.Configuration                  - Config uses
> deprecated configuration key 'web.port' instead of proper key
> 'rest.bind-port'
> 2019-08-21 20:00:25,469 INFO
> org.apache.flink.runtime.clusterframework.BootstrapTools      - Setting
> directories for temporary files to:
> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003
> 2019-08-21 20:00:25,485 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Starting
> YarnJobClusterEntrypoint.
> 2019-08-21 20:00:25,485 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Install
> default filesystem.
> 2019-08-21 20:00:25,561 INFO
> org.apache.flink.runtime.security.modules.HadoopModule        - Hadoop user
> set to root (auth:SIMPLE)
> 2019-08-21 20:00:25,580 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -
> Initializing cluster services.
> 2019-08-21 20:00:25,757 INFO
> org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Trying to
> start actor system at host53:0
> 2019-08-21 20:00:26,187 INFO
> akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger
> started
> 2019-08-21 20:00:26,251 INFO
> akka.remote.Remoting                                          - Starting
> remoting
> 2019-08-21 20:00:26,439 INFO
> akka.remote.Remoting                                          - Remoting
> started; listening on addresses :[akka.tcp://flink@host53:59609]
> 2019-08-21 20:00:26,447 INFO
> org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Actor
> system started at akka.tcp://flink@host53:59609
> 2019-08-21 20:00:27,011 INFO
> org.apache.flink.runtime.blob.FileSystemBlobStore             - Creating
> highly available BLOB storage directory at
> hdfs://host51:9000/flink/ha-yarn/application_1566383236573_0003/blob
> 2019-08-21 20:00:27,097 INFO
> org.apache.flink.runtime.util.ZooKeeperUtils                  - Enforcing
> default ACL for ZK connections
> 2019-08-21 20:00:27,097 INFO
> org.apache.flink.runtime.util.ZooKeeperUtils                  - Using
> '/flink-yarn/application_1566383236573_0003' as Zookeeper namespace.
> 2019-08-21 20:00:27,161 INFO
> org.apache.flink.shaded.curator.org.apache.curator.framework.imps.CuratorFrameworkImpl
> - Starting
> 2019-08-21 20:00:27,169 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:zookeeper.version=3.4.10-39d3a4f269333c922ed3db283be479f9deacaa0f,
> built on 03/23/2017 10:13 GMT
> 2019-08-21 20:00:27,169 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:host.name=host53
> 2019-08-21 20:00:27,169 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:java.version=1.8.0_77
> 2019-08-21 20:00:27,169 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:java.vendor=Oracle Corporation
> 2019-08-21 20:00:27,169 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:java.library.path=:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:java.io.tmpdir=/tmp
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:java.compiler=<NA>
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:os.name=Linux
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:os.arch=amd64
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:os.version=3.16.0-0.bpo.4-amd64
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:user.name=root
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:user.home=/root
> 2019-08-21 20:00:27,170 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
> environment:user.dir=/usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/container_1566383236573_0003_01_000001
> 2019-08-21 20:00:27,171 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Initiating client connection, connectString=host51:2181
> sessionTimeout=60000
> watcher=org.apache.flink.shaded.curator.org.apache.curator.ConnectionState@319854f0
> 2019-08-21 20:00:27,184 WARN
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - SASL
> configuration failed: javax.security.auth.login.LoginException: No JAAS
> configuration section named 'Client' was found in specified JAAS
> configuration file: '/tmp/jaas-4274640356638794035.conf'. Will continue
> connection to Zookeeper server without SASL authentication, if Zookeeper
> server allows it.
> 2019-08-21 20:00:27,186 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  -
> Opening socket connection to server host51/10.10.10.51:2181
> 2019-08-21 20:00:27,187 ERROR
> org.apache.flink.shaded.curator.org.apache.curator.ConnectionState  -
> Authentication failed
> 2019-08-21 20:00:27,187 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Socket
> connection established to host51/10.10.10.51:2181, initiating session
> 2019-08-21 20:00:27,193 INFO
> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  -
> Session establishment complete on server host51/10.10.10.51:2181,
> sessionid = 0x1001a86de810008, negotiated timeout = 40000
> 2019-08-21 20:00:27,193 INFO
> org.apache.flink.runtime.blob.BlobServer                      - Created
> BLOB server storage directory
> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-eaa2bb2b-cddb-42e3-8ce5-bf2f83e8567a
> 2019-08-21 20:00:27,195 INFO
> org.apache.flink.shaded.curator.org.apache.curator.framework.state.ConnectionStateManager
> - State change: CONNECTED
> 2019-08-21 20:00:27,198 INFO
> org.apache.flink.runtime.blob.BlobServer                      - Started
> BLOB server at 0.0.0.0:58591 - max concurrent requests: 50 - max backlog:
> 1000
> 2019-08-21 20:00:27,213 INFO
> org.apache.flink.runtime.metrics.MetricRegistryImpl           - No metrics
> reporter configured, no metrics will be exposed/reported.
> 2019-08-21 20:00:27,215 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Trying to
> start actor system at host53:0
> 2019-08-21 20:00:27,239 INFO
> akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger
> started
> 2019-08-21 20:00:27,247 INFO
> akka.remote.Remoting                                          - Starting
> remoting
> 2019-08-21 20:00:27,255 INFO
> akka.remote.Remoting                                          - Remoting
> started; listening on addresses :[akka.tcp://flink-metrics@host53:59941]
> 2019-08-21 20:00:27,257 INFO
> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Actor
> system started at akka.tcp://flink-metrics@host53:59941
> 2019-08-21 20:00:27,263 INFO
> org.apache.flink.runtime.blob.TransientBlobCache              - Created
> BLOB cache storage directory
> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-405a4797-824f-48da-bb50-2ee2e230d249
> 2019-08-21 20:00:27,296 WARN
> org.apache.flink.configuration.Configuration                  - Config uses
> deprecated configuration key 'web.port' instead of proper key
> 'rest.bind-port'
> 2019-08-21 20:00:27,298 WARN
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Upload
> directory
> /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload does
> not exist, or has been deleted externally. Previously uploaded files are no
> longer available.
> 2019-08-21 20:00:27,298 INFO
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Created
> directory
> /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload for
> file uploads.
> 2019-08-21 20:00:27,320 INFO
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Starting
> rest endpoint.
> 2019-08-21 20:00:27,616 INFO
> org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined
> location of main cluster component log file:
> /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.log
> 2019-08-21 20:00:27,616 INFO
> org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined
> location of main cluster component stdout file:
> /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.out
> 2019-08-21 20:00:27,771 INFO
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Rest
> endpoint listening at host53:46401
> 2019-08-21 20:00:27,772 INFO
> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
> Starting ZooKeeperLeaderElectionService
> ZooKeeperLeaderElectionService{leaderPath='/leader/rest_server_lock'}.
> 2019-08-21 20:00:27,795 WARN
> org.apache.flink.shaded.curator.org.apache.curator.utils.ZKPaths  - The
> version of ZooKeeper being used doesn't support Container nodes.
> CreateMode.PERSISTENT will be used instead.
> 2019-08-21 20:00:27,807 INFO
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Web
> frontend listening at http://host53:46401.
> 2019-08-21 20:00:27,836 INFO
> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  -
> http://host53:46401 was granted leadership with
> leaderSessionID=8487757f-fd82-42b2-a1a6-adcc69e1eba1
> 2019-08-21 20:00:27,932 INFO
> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
> RPC endpoint for org.apache.flink.yarn.YarnResourceManager at
> akka://flink/user/resourcemanager .
> 2019-08-21 20:00:28,021 INFO
> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
> RPC endpoint for org.apache.flink.runtime.dispatcher.MiniDispatcher at
> akka://flink/user/dispatcher .
> 2019-08-21 20:00:28,033 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
> 2019-08-21 20:00:28,033 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService /leader/dispatcher_lock.
> 2019-08-21 20:00:28,033 INFO
> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
> Starting ZooKeeperLeaderElectionService
> ZooKeeperLeaderElectionService{leaderPath='/leader/dispatcher_lock'}.
> 2019-08-21 20:00:28,045 INFO
> org.apache.flink.runtime.dispatcher.MiniDispatcher            - Dispatcher
> akka.tcp://flink@host53:59609/user/dispatcher was granted leadership with
> fencing token 6216d17e-56b1-43bb-9bb3-f9fa5b8921be
> 2019-08-21 20:00:28,048 INFO
> org.apache.flink.runtime.dispatcher.MiniDispatcher            - Recovering
> all persisted jobs.
> 2019-08-21 20:00:28,054 INFO
> org.apache.hadoop.yarn.client.RMProxy                         - Connecting
> to ResourceManager at host51/10.10.10.51:8030
> 2019-08-21 20:00:28,078 INFO
> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
> RPC endpoint for org.apache.flink.runtime.jobmaster.JobMaster at
> akka://flink/user/jobmanager_0 .
> 2019-08-21 20:00:28,086 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  -
> Initializing job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
> 2019-08-21 20:00:28,099 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - Using
> restart strategy
> FixedDelayRestartStrategy(maxNumberRestartAttempts=2147483647,
> delayBetweenRestartAttempts=0) for StreamKafkaMain
> (e5ea83238d721e932b4645589660406c).
> 2019-08-21 20:00:28,122 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job
> recovers via failover strategy: full graph restart
> 2019-08-21 20:00:28,153 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - Running
> initialization on master for job StreamKafkaMain
> (e5ea83238d721e932b4645589660406c).
> 2019-08-21 20:00:28,154 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  -
> Successfully ran initialization on master in 0 ms.
> 2019-08-21 20:00:28,187 INFO
> org.apache.flink.runtime.util.ZooKeeperUtils                  - Initialized
> ZooKeeperCompletedCheckpointStore in
> '/checkpoints/e5ea83238d721e932b4645589660406c'.
> 2019-08-21 20:00:28,196 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Recovered 0
> containers from previous attempts ([]).
> 2019-08-21 20:00:28,199 INFO
> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
> yarn.client.max-cached-nodemanagers-proxies : 0
> 2019-08-21 20:00:28,200 INFO
> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
> Starting ZooKeeperLeaderElectionService
> ZooKeeperLeaderElectionService{leaderPath='/leader/resource_manager_lock'}.
> 2019-08-21 20:00:28,203 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - No state
> backend has been configured, using default (Memory / JobManager)
> MemoryStateBackend (data in heap memory / checkpoints to JobManager)
> (checkpoints: 'null', savepoints: 'null', asynchronous: TRUE, maxStateSize:
> 5242880)
> 2019-08-21 20:00:28,211 INFO
> org.apache.flink.yarn.YarnResourceManager                     -
> ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager was
> granted leadership with fencing token 97537c0d61bbf091e37cb6c45b784568
> 2019-08-21 20:00:28,214 INFO
> org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  -
> Starting the SlotManager.
> 2019-08-21 20:00:28,225 INFO
> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
> Recovering checkpoints from ZooKeeper.
> 2019-08-21 20:00:28,229 INFO
> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
> Found 0 checkpoints in ZooKeeper.
> 2019-08-21 20:00:28,229 INFO
> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
> Trying to fetch 0 checkpoints from storage.
> 2019-08-21 20:00:28,233 INFO
> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
> Starting ZooKeeperLeaderElectionService
> ZooKeeperLeaderElectionService{leaderPath='/leader/e5ea83238d721e932b4645589660406c/job_manager_lock'}.
> 2019-08-21 20:00:28,248 INFO
> org.apache.flink.runtime.jobmaster.JobManagerRunner           - JobManager
> runner for job StreamKafkaMain (e5ea83238d721e932b4645589660406c) was
> granted leadership with session id 12c96c94-ebbd-4991-a457-6f0320d71bc1 at
> akka.tcp://flink@host53:59609/user/jobmanager_0.
> 2019-08-21 20:00:28,260 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
> 2019-08-21 20:00:28,260 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - Starting
> execution of job StreamKafkaMain (e5ea83238d721e932b4645589660406c) under
> job master id a4576f0320d71bc112c96c94ebbd4991.
> 2019-08-21 20:00:28,261 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job
> StreamKafkaMain (e5ea83238d721e932b4645589660406c) switched from state
> CREATED to RUNNING.
> 2019-08-21 20:00:28,266 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
> (ee39866835f7b7387f518e352787a532) switched from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,280 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> serve slot request, no ResourceManager connected. Adding as pending request
> [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}]
> 2019-08-21 20:00:28,287 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
> (f1a893fc5e2440331c8e6db1874ad220) switched from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,288 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> serve slot request, no ResourceManager connected. Adding as pending request
> [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}]
> 2019-08-21 20:00:28,288 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
> (4756a0450881325af5fb396a18dbb8ae) switched from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,289 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> serve slot request, no ResourceManager connected. Adding as pending request
> [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}]
> 2019-08-21 20:00:28,289 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
> (78f520582607e26b365fca483fc98d4c) switched from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,289 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> serve slot request, no ResourceManager connected. Adding as pending request
> [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}]
> 2019-08-21 20:00:28,289 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select:
> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
> (0071e1c7c4f70646914329f81dcbd349) switched from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from CREATED
> to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from CREATED
> to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from CREATED
> to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from CREATED
> to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched
> from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched
> from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,291 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched
> from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,292 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched
> from CREATED to SCHEDULED.
> 2019-08-21 20:00:28,295 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - Connecting
> to ResourceManager akka.tcp://flink@host53
> :59609/user/resourcemanager(97537c0d61bbf091e37cb6c45b784568)
> 2019-08-21 20:00:28,300 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - Resolved
> ResourceManager address, beginning registration
> 2019-08-21 20:00:28,300 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  -
> Registration at ResourceManager attempt 1 (timeout=100ms)
> 2019-08-21 20:00:28,304 INFO
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> Starting ZooKeeperLeaderRetrievalService
> /leader/e5ea83238d721e932b4645589660406c/job_manager_lock.
> 2019-08-21 20:00:28,304 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Registering
> job manager [hidden email]://flink@host53:59609/user/jobmanager_0
> for job e5ea83238d721e932b4645589660406c.
> 2019-08-21 20:00:28,310 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Registered
> job manager [hidden email]://flink@host53:59609/user/jobmanager_0
> for job e5ea83238d721e932b4645589660406c.
> 2019-08-21 20:00:28,313 INFO
> org.apache.flink.runtime.jobmaster.JobMaster                  - JobManager
> successfully registered at ResourceManager, leader id:
> 97537c0d61bbf091e37cb6c45b784568.
> 2019-08-21 20:00:28,314 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
> new slot [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}] and profile
> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> 2019-08-21 20:00:28,315 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Request
> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
> e5ea83238d721e932b4645589660406c with allocation id
> faa6a521758e5be4b3dc1955ca0f51b4.
> 2019-08-21 20:00:28,316 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
> new slot [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}] and profile
> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> 2019-08-21 20:00:28,316 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
> new slot [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}] and profile
> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> 2019-08-21 20:00:28,316 INFO
> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
> new slot [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}] and profile
> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> 2019-08-21 20:00:28,321 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Requesting
> new TaskExecutor container with resources <memory:8192, vCores:2>. Number
> pending requests 1.
> 2019-08-21 20:00:28,322 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Request
> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
> e5ea83238d721e932b4645589660406c with allocation id
> a2b419aea4a373686eeed6c46163a528.
> 2019-08-21 20:00:28,323 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Request
> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
> e5ea83238d721e932b4645589660406c with allocation id
> cbc11737ce6729cfe45f24c43180b5c8.
> 2019-08-21 20:00:28,323 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Requesting
> new TaskExecutor container with resources <memory:8192, vCores:2>. Number
> pending requests 2.
> 2019-08-21 20:00:28,323 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Request
> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
> e5ea83238d721e932b4645589660406c with allocation id
> f6566567c173a55b2bc52b72bcb24ff8.
> 2019-08-21 20:00:31,597 INFO
> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint
> triggering task Source: Custom Source -> Flat Map -> Filter -> Filter ->
> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong,
> rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute:
> (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state
> RUNNING but SCHEDULED instead. Aborting checkpoint.
> 2019-08-21 20:00:33,749 INFO
> org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received
> new token for : host53:59938
> 2019-08-21 20:00:33,749 INFO
> org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received
> new token for : host52:33246
> 2019-08-21 20:00:33,765 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Received
> new container: container_1566383236573_0003_01_000002 - Remaining pending
> container requests: 2
> 2019-08-21 20:00:33,765 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Removing
> container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending
> container requests 1.
> 2019-08-21 20:00:34,009 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Creating
> container launch context for TaskManagers
> 2019-08-21 20:00:34,010 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Starting
> TaskManagers
> 2019-08-21 20:00:34,021 INFO
> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
> Opening proxy : host53:59938
> 2019-08-21 20:00:34,066 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Received
> new container: container_1566383236573_0003_01_000003 - Remaining pending
> container requests: 1
> 2019-08-21 20:00:34,066 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Removing
> container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending
> container requests 0.
> 2019-08-21 20:00:34,088 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Creating
> container launch context for TaskManagers
> 2019-08-21 20:00:34,089 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Starting
> TaskManagers
> 2019-08-21 20:00:34,090 INFO
> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
> Opening proxy : host52:33246
> 2019-08-21 20:00:36,597 INFO
> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint
> triggering task Source: Custom Source -> Flat Map -> Filter -> Filter ->
> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong,
> rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute:
> (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state
> RUNNING but SCHEDULED instead. Aborting checkpoint.
> 2019-08-21 20:00:37,856 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Registering
> TaskManager with ResourceID container_1566383236573_0003_01_000002
> (akka.tcp://flink@host53:34608/user/taskmanager_0) at ResourceManager
> 2019-08-21 20:00:39,573 INFO
> org.apache.flink.yarn.YarnResourceManager                     - Registering
> TaskManager with ResourceID container_1566383236573_0003_01_000003
> (akka.tcp://flink@host52:32931/user/taskmanager_0) at ResourceManager
> 2019-08-21 20:00:39,659 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
> (ee39866835f7b7387f518e352787a532) switched from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,659 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
> (attempt #0) to container_1566383236573_0003_01_000002 @ host53
> (dataPort=52456)
> 2019-08-21 20:00:39,664 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
> (f1a893fc5e2440331c8e6db1874ad220) switched from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,664 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
> (attempt #0) to container_1566383236573_0003_01_000002 @ host53
> (dataPort=52456)
> 2019-08-21 20:00:39,665 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
> (4756a0450881325af5fb396a18dbb8ae) switched from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,666 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
> (attempt #0) to container_1566383236573_0003_01_000003 @ host52
> (dataPort=39650)
> 2019-08-21 20:00:39,666 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
> (78f520582607e26b365fca483fc98d4c) switched from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,666 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
> (attempt #0) to container_1566383236573_0003_01_000003 @ host52
> (dataPort=39650)
> 2019-08-21 20:00:39,666 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select:
> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
> (0071e1c7c4f70646914329f81dcbd349) switched from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,666 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select:
> (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start,
> end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime)
> -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (attempt
> #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> 2019-08-21 20:00:39,670 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from
> SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,670 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Sink: Print to Std. Out (1/4) (attempt #0) to
> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> 2019-08-21 20:00:39,671 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from
> SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,672 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Sink: Print to Std. Out (2/4) (attempt #0) to
> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> 2019-08-21 20:00:39,672 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from
> SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,672 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Sink: Print to Std. Out (3/4) (attempt #0) to
> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> 2019-08-21 20:00:39,672 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from
> SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,673 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Sink: Print to Std. Out (4/4) (attempt #0) to
> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> 2019-08-21 20:00:39,673 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched
> from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,673 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Map -> Sink: Print to Std. Out (1/4) (attempt #0) to
> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> 2019-08-21 20:00:39,675 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched
> from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,675 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Map -> Sink: Print to Std. Out (2/4) (attempt #0) to
> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> 2019-08-21 20:00:39,675 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched
> from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,675 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Map -> Sink: Print to Std. Out (3/4) (attempt #0) to
> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> 2019-08-21 20:00:39,676 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched
> from SCHEDULED to DEPLOYING.
> 2019-08-21 20:00:39,676 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
> Map -> Sink: Print to Std. Out (4/4) (attempt #0) to
> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> 2019-08-21 20:00:39,782 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from
> DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,783 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from
> DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,784 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched
> from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,787 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched
> from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,806 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
> (f1a893fc5e2440331c8e6db1874ad220) switched from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,809 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
> (ee39866835f7b7387f518e352787a532) switched from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,809 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select:
> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
> (0071e1c7c4f70646914329f81dcbd349) switched from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,991 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from
> DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,992 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from
> DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,996 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched
> from DEPLOYING to RUNNING.
> 2019-08-21 20:00:39,997 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched
> from DEPLOYING to RUNNING.
> 2019-08-21 20:00:40,005 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
> (78f520582607e26b365fca483fc98d4c) switched from DEPLOYING to RUNNING.
> 2019-08-21 20:00:40,006 INFO
> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
> (4756a0450881325af5fb396a18dbb8ae) switched from DEPLOYING to RUNNING.
> 2019-08-21 20:00:41,605 INFO
> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Triggering
> checkpoint 1 @ 1566388841597 for job e5ea83238d721e932b4645589660406c.
> 2019-08-21 20:00:42,466 INFO
> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Completed
> checkpoint 1 for job e5ea83238d721e932b4645589660406c (7994 bytes in 821
> ms).
>
>
>
> ------------------ 原始邮件 ------------------
> 发件人: "Yun Tang"<[hidden email]>;
> 发送时间: 2019年8月22日(星期四) 中午11:08
> 收件人: "user-zh"<[hidden email]>;
>
> 主题: Re: flink启动等待10分钟问题
>
>
>
> Hi
>
> Flink on YARN作业启动时间长,有很多原因,例如资源不够在等待,container申请的时候又退出了。默认的slot
> request的timeout时间是5min,感觉你的作业应该是可能遇到了一个slot request
> timeout,然后又重新申请。最好能提供一下jobmanager的日志才好进一步分析。
>
> 祝好
> 唐云
> ________________________________
> From: 々守护々 <[hidden email]>
> Sent: Thursday, August 22, 2019 11:04
> To: user-zh <[hidden email]>
> Subject: flink启动等待10分钟问题
>
> 各位Flink社区大佬,您好!
>       我使用Flink SQL (Flink 1.8.1),在hadoop 集群中调整了一下yarn-site.xml文件的内存参数:
>         <property>
>                 <name>yarn.nodemanager.resource.memory-mb</name>
>                 <value>16384</value>
>         </property>
>         <property>
>                 <name>yarn.scheduler.minimum-allocation-mb</name>
>                 <value>1024</value>
>         </property>
>         <property>
>                 <name>yarn.scheduler.maximum-allocation-mb</name>
>                 <value>8192</value>
>         </property>
>
>
>
> 然后我去启动flink on yarn任务,启动参数:./flink-1.8.1/bin/flink run -m yarn-cluster -p
> 4 -d -c streaming.StreamKafkaMain -yn 4 -ys 2 -yjm 2048 -ytm 8192  -ynm
> bigdata6 ./flink_bigdata6.jar
> 最终能能启动成功,就是启动要等待10分钟,这个等待时间太长了,请问各位大佬有什么解决的方法吗?
>
Reply | Threaded
Open this post in threaded view
|

回复: 回复: flink启动等待10分钟问题

守护
In reply to this post by tison
这个是终端打印的日志,十分钟后才启动- Submitting application master application_1566383236573_0004



2019-08-22 11:28:21,766 WARN  org.apache.flink.yarn.AbstractYarnClusterDescriptor           - This YARN session requires 34816MB of memory in the cluster. There are currently only 26624MB available.
The Flink YARN client will try to allocate the YARN session, but maybe not all TaskManagers are connecting from the beginning because the resources are currently not available in the cluster. The allocation might take more time than usual because the Flink YARN client needs to wait until the resources become available.
2019-08-22 11:28:21,766 WARN  org.apache.flink.yarn.AbstractYarnClusterDescriptor           - There is not enough memory available in the YARN cluster. The TaskManager(s) require 8192MB each. NodeManagers available: [14336, 12288]
After allocating the JobManager (2048MB) and (2/4) TaskManagers, the following NodeManagers are available: [4096, 4096]
The Flink YARN client will try to allocate the YARN session, but maybe not all TaskManagers are connecting from the beginning because the resources are currently not available in the cluster. The allocation might take more time than usual because the Flink YARN client needs to wait until the resources become available.
2019-08-22 11:28:21,766 WARN  org.apache.flink.yarn.AbstractYarnClusterDescriptor           - There is not enough memory available in the YARN cluster. The TaskManager(s) require 8192MB each. NodeManagers available: [14336, 12288]
After allocating the JobManager (2048MB) and (3/4) TaskManagers, the following NodeManagers are available: [4096, 4096]
The Flink YARN client will try to allocate the YARN session, but maybe not all TaskManagers are connecting from the beginning because the resources are currently not available in the cluster. The allocation might take more time than usual because the Flink YARN client needs to wait until the resources become available.
2019-08-22 11:28:21,767 INFO  org.apache.flink.yarn.AbstractYarnClusterDescriptor           - Cluster specification: ClusterSpecification{masterMemoryMB=2048, taskManagerMemoryMB=8192, numberTaskManagers=4, slotsPerTaskManager=2}
2019-08-22 11:28:22,317 WARN  org.apache.flink.yarn.AbstractYarnClusterDescriptor           - The configuration directory ('/usr/flink/flink-1.8.1/conf') contains both LOG4J and Logback configuration files. Please delete or rename one of them.
2019-08-22 11:38:02,185 INFO  org.apache.flink.yarn.AbstractYarnClusterDescriptor           - Submitting application master application_1566383236573_0004
2019-08-22 11:38:02,226 INFO  org.apache.hadoop.yarn.client.api.impl.YarnClientImpl         - Submitted application application_1566383236573_0004
2019-08-22 11:38:02,226 INFO  org.apache.flink.yarn.AbstractYarnClusterDescriptor           - Waiting for the cluster to be allocated
2019-08-22 11:38:02,228 INFO  org.apache.flink.yarn.AbstractYarnClusterDescriptor           - Deploying cluster, current state ACCEPTED
2019-08-22 11:38:07,244 INFO  org.apache.flink.yarn.AbstractYarnClusterDescriptor           - YARN application has been deployed successfully.
2019-08-22 11:38:07,244 INFO  org.apache.flink.yarn.AbstractYarnClusterDescriptor           - The Flink YARN client has been started in detached mode. In order to stop Flink on YARN, use the following command or a YARN web interface to stop it:
yarn application -kill application_1566383236573_0004
Please also note that the temporary files of the YARN session in the home directory will not be removed.
Job has been submitted with JobID c16c4fc1556ccb2ceaaa2f0e8f32ec88

------------------ 原始邮件 ------------------
发件人: "Zili Chen"<[hidden email]>;
发送时间: 2019年8月22日(星期四) 中午11:37
收件人: "user-zh"<[hidden email]>;
主题: Re: 回复: flink启动等待10分钟问题

你说的【停在那儿了】是说 flink run 的终端输出不动了吗?你看一下这个终端输出里 YARN 是什么时候 accept 你的应用的,我怀疑是
YARN 集群忙导致 10 分钟没响应。

Best,
tison.


Zili Chen <[hidden email]> 于2019年8月22日周四 上午11:35写道:

> user-zh 不支持贴图,你用下第三方存储然后贴个链接吧,或者我记得可以传邮件附件
>
> Best,
> tison.
>
>
> 々守护々 <[hidden email]> 于2019年8月22日周四 上午11:33写道:
>
>> 是这样的,在启动的是后客户端日志就停在那了,等待10分钟后才能生成yarn的application
>> <http://hadoop.changba.com/cluster/app/application_1566383236573_0003>
>>
>>
>> ------------------ 原始邮件 ------------------
>> *发件人:* "Yun Tang"<[hidden email]>;
>> *发送时间:* 2019年8月22日(星期四) 中午11:23
>> *收件人:* "user-zh"<[hidden email]>;
>> *主题:* Re: 回复: flink启动等待10分钟问题
>>
>> 启动时间是 20:00:25,task都处于running甚至第一次checkpoint
>> completed的时间是20:00:42,一共才17秒,何来10分钟的问题?
>> ________________________________
>> From: 々守护々 <[hidden email]>
>> Sent: Thursday, August 22, 2019 11:18
>> To: user-zh <[hidden email]>
>> Subject: 回复: flink启动等待10分钟问题
>>
>> 您好,这个是我jobmanager启动日志,请帮忙看看,谢谢!
>>
>>
>> 2019-08-21 20:00:25,428 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -
>> --------------------------------------------------------------------------------
>> 2019-08-21 20:00:25,430 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Registered
>> UNIX signal handlers for [TERM, HUP, INT]
>> 2019-08-21 20:00:25,433 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - YARN daemon
>> is running as: root Yarn client user obtainer: root
>> 2019-08-21 20:00:25,437 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: high-availability.cluster-id,
>> application_1566383236573_0003
>> 2019-08-21 20:00:25,437 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.rpc.address, localhost
>> 2019-08-21 20:00:25,437 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.rpc.port, 6123
>> 2019-08-21 20:00:25,437 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: high-availability.zookeeper.path.root, /flink-yarn
>> 2019-08-21 20:00:25,437 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: high-availability.storageDir,
>> hdfs://host51:9000/flink/ha-yarn
>> 2019-08-21 20:00:25,438 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: high-availability.zookeeper.quorum, host51:2181
>> 2019-08-21 20:00:25,438 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: internal.cluster.execution-mode, DETACHED
>> 2019-08-21 20:00:25,438 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: high-availability, zookeeper
>> 2019-08-21 20:00:25,438 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: parallelism.default, 1
>> 2019-08-21 20:00:25,438 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: yarn.application-attempts, 10
>> 2019-08-21 20:00:25,438 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.numberOfTaskSlots, 2
>> 2019-08-21 20:00:25,439 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: taskmanager.heap.size, 8192m
>> 2019-08-21 20:00:25,439 INFO
>> org.apache.flink.configuration.GlobalConfiguration            - Loading
>> configuration property: jobmanager.heap.size, 2048m
>> 2019-08-21 20:00:25,465 WARN
>> org.apache.flink.configuration.Configuration                  - Config uses
>> deprecated configuration key 'web.port' instead of proper key
>> 'rest.bind-port'
>> 2019-08-21 20:00:25,469 INFO
>> org.apache.flink.runtime.clusterframework.BootstrapTools      - Setting
>> directories for temporary files to:
>> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003
>> 2019-08-21 20:00:25,485 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Starting
>> YarnJobClusterEntrypoint.
>> 2019-08-21 20:00:25,485 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Install
>> default filesystem.
>> 2019-08-21 20:00:25,561 INFO
>> org.apache.flink.runtime.security.modules.HadoopModule        - Hadoop user
>> set to root (auth:SIMPLE)
>> 2019-08-21 20:00:25,580 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -
>> Initializing cluster services.
>> 2019-08-21 20:00:25,757 INFO
>> org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Trying to
>> start actor system at host53:0
>> 2019-08-21 20:00:26,187 INFO
>> akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger
>> started
>> 2019-08-21 20:00:26,251 INFO
>> akka.remote.Remoting                                          - Starting
>> remoting
>> 2019-08-21 20:00:26,439 INFO
>> akka.remote.Remoting                                          - Remoting
>> started; listening on addresses :[akka.tcp://flink@host53:59609]
>> 2019-08-21 20:00:26,447 INFO
>> org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Actor
>> system started at akka.tcp://flink@host53:59609
>> 2019-08-21 20:00:27,011 INFO
>> org.apache.flink.runtime.blob.FileSystemBlobStore             - Creating
>> highly available BLOB storage directory at
>> hdfs://host51:9000/flink/ha-yarn/application_1566383236573_0003/blob
>> 2019-08-21 20:00:27,097 INFO
>> org.apache.flink.runtime.util.ZooKeeperUtils                  - Enforcing
>> default ACL for ZK connections
>> 2019-08-21 20:00:27,097 INFO
>> org.apache.flink.runtime.util.ZooKeeperUtils                  - Using
>> '/flink-yarn/application_1566383236573_0003' as Zookeeper namespace.
>> 2019-08-21 20:00:27,161 INFO
>> org.apache.flink.shaded.curator.org.apache.curator.framework.imps.CuratorFrameworkImpl
>> - Starting
>> 2019-08-21 20:00:27,169 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:zookeeper.version=3.4.10-39d3a4f269333c922ed3db283be479f9deacaa0f,
>> built on 03/23/2017 10:13 GMT
>> 2019-08-21 20:00:27,169 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:host.name=host53
>> 2019-08-21 20:00:27,169 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:java.version=1.8.0_77
>> 2019-08-21 20:00:27,169 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:java.vendor=Oracle Corporation
>> 2019-08-21 20:00:27,169 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:java.library.path=:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:java.io.tmpdir=/tmp
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:java.compiler=<NA>
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:os.name=Linux
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:os.arch=amd64
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:os.version=3.16.0-0.bpo.4-amd64
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:user.name=root
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:user.home=/root
>> 2019-08-21 20:00:27,170 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  - Client
>> environment:user.dir=/usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/container_1566383236573_0003_01_000001
>> 2019-08-21 20:00:27,171 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
>> Initiating client connection, connectString=host51:2181
>> sessionTimeout=60000
>> watcher=org.apache.flink.shaded.curator.org.apache.curator.ConnectionState@319854f0
>> 2019-08-21 20:00:27,184 WARN
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - SASL
>> configuration failed: javax.security.auth.login.LoginException: No JAAS
>> configuration section named 'Client' was found in specified JAAS
>> configuration file: '/tmp/jaas-4274640356638794035.conf'. Will continue
>> connection to Zookeeper server without SASL authentication, if Zookeeper
>> server allows it.
>> 2019-08-21 20:00:27,186 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  -
>> Opening socket connection to server host51/10.10.10.51:2181
>> 2019-08-21 20:00:27,187 ERROR
>> org.apache.flink.shaded.curator.org.apache.curator.ConnectionState  -
>> Authentication failed
>> 2019-08-21 20:00:27,187 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  - Socket
>> connection established to host51/10.10.10.51:2181, initiating session
>> 2019-08-21 20:00:27,193 INFO
>> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  -
>> Session establishment complete on server host51/10.10.10.51:2181,
>> sessionid = 0x1001a86de810008, negotiated timeout = 40000
>> 2019-08-21 20:00:27,193 INFO
>> org.apache.flink.runtime.blob.BlobServer                      - Created
>> BLOB server storage directory
>> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-eaa2bb2b-cddb-42e3-8ce5-bf2f83e8567a
>> 2019-08-21 20:00:27,195 INFO
>> org.apache.flink.shaded.curator.org.apache.curator.framework.state.ConnectionStateManager
>> - State change: CONNECTED
>> 2019-08-21 20:00:27,198 INFO
>> org.apache.flink.runtime.blob.BlobServer                      - Started
>> BLOB server at 0.0.0.0:58591 - max concurrent requests: 50 - max
>> backlog: 1000
>> 2019-08-21 20:00:27,213 INFO
>> org.apache.flink.runtime.metrics.MetricRegistryImpl           - No metrics
>> reporter configured, no metrics will be exposed/reported.
>> 2019-08-21 20:00:27,215 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Trying to
>> start actor system at host53:0
>> 2019-08-21 20:00:27,239 INFO
>> akka.event.slf4j.Slf4jLogger                                  - Slf4jLogger
>> started
>> 2019-08-21 20:00:27,247 INFO
>> akka.remote.Remoting                                          - Starting
>> remoting
>> 2019-08-21 20:00:27,255 INFO
>> akka.remote.Remoting                                          - Remoting
>> started; listening on addresses :[akka.tcp://flink-metrics@host53:59941]
>> 2019-08-21 20:00:27,257 INFO
>> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Actor
>> system started at akka.tcp://flink-metrics@host53:59941
>> 2019-08-21 20:00:27,263 INFO
>> org.apache.flink.runtime.blob.TransientBlobCache              - Created
>> BLOB cache storage directory
>> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-405a4797-824f-48da-bb50-2ee2e230d249
>> 2019-08-21 20:00:27,296 WARN
>> org.apache.flink.configuration.Configuration                  - Config uses
>> deprecated configuration key 'web.port' instead of proper key
>> 'rest.bind-port'
>> 2019-08-21 20:00:27,298 WARN
>> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Upload
>> directory
>> /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload does
>> not exist, or has been deleted externally. Previously uploaded files are no
>> longer available.
>> 2019-08-21 20:00:27,298 INFO
>> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Created
>> directory
>> /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload for
>> file uploads.
>> 2019-08-21 20:00:27,320 INFO
>> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Starting
>> rest endpoint.
>> 2019-08-21 20:00:27,616 INFO
>> org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined
>> location of main cluster component log file:
>> /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.log
>> 2019-08-21 20:00:27,616 INFO
>> org.apache.flink.runtime.webmonitor.WebMonitorUtils           - Determined
>> location of main cluster component stdout file:
>> /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.out
>> 2019-08-21 20:00:27,771 INFO
>> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Rest
>> endpoint listening at host53:46401
>> 2019-08-21 20:00:27,772 INFO
>> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
>> Starting ZooKeeperLeaderElectionService
>> ZooKeeperLeaderElectionService{leaderPath='/leader/rest_server_lock'}.
>> 2019-08-21 20:00:27,795 WARN
>> org.apache.flink.shaded.curator.org.apache.curator.utils.ZKPaths  - The
>> version of ZooKeeper being used doesn't support Container nodes.
>> CreateMode.PERSISTENT will be used instead.
>> 2019-08-21 20:00:27,807 INFO
>> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Web
>> frontend listening at http://host53:46401.
>> 2019-08-21 20:00:27,836 INFO
>> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  -
>> http://host53:46401 was granted leadership with
>> leaderSessionID=8487757f-fd82-42b2-a1a6-adcc69e1eba1
>> 2019-08-21 20:00:27,932 INFO
>> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
>> RPC endpoint for org.apache.flink.yarn.YarnResourceManager at
>> akka://flink/user/resourcemanager .
>> 2019-08-21 20:00:28,021 INFO
>> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
>> RPC endpoint for org.apache.flink.runtime.dispatcher.MiniDispatcher at
>> akka://flink/user/dispatcher .
>> 2019-08-21 20:00:28,033 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
>> 2019-08-21 20:00:28,033 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Starting ZooKeeperLeaderRetrievalService /leader/dispatcher_lock.
>> 2019-08-21 20:00:28,033 INFO
>> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
>> Starting ZooKeeperLeaderElectionService
>> ZooKeeperLeaderElectionService{leaderPath='/leader/dispatcher_lock'}.
>> 2019-08-21 20:00:28,045 INFO
>> org.apache.flink.runtime.dispatcher.MiniDispatcher            - Dispatcher
>> akka.tcp://flink@host53:59609/user/dispatcher was granted leadership
>> with fencing token 6216d17e-56b1-43bb-9bb3-f9fa5b8921be
>> 2019-08-21 20:00:28,048 INFO
>> org.apache.flink.runtime.dispatcher.MiniDispatcher            - Recovering
>> all persisted jobs.
>> 2019-08-21 20:00:28,054 INFO
>> org.apache.hadoop.yarn.client.RMProxy                         - Connecting
>> to ResourceManager at host51/10.10.10.51:8030
>> 2019-08-21 20:00:28,078 INFO
>> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
>> RPC endpoint for org.apache.flink.runtime.jobmaster.JobMaster at
>> akka://flink/user/jobmanager_0 .
>> 2019-08-21 20:00:28,086 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  -
>> Initializing job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
>> 2019-08-21 20:00:28,099 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  - Using
>> restart strategy
>> FixedDelayRestartStrategy(maxNumberRestartAttempts=2147483647,
>> delayBetweenRestartAttempts=0) for StreamKafkaMain
>> (e5ea83238d721e932b4645589660406c).
>> 2019-08-21 20:00:28,122 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job
>> recovers via failover strategy: full graph restart
>> 2019-08-21 20:00:28,153 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  - Running
>> initialization on master for job StreamKafkaMain
>> (e5ea83238d721e932b4645589660406c).
>> 2019-08-21 20:00:28,154 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  -
>> Successfully ran initialization on master in 0 ms.
>> 2019-08-21 20:00:28,187 INFO
>> org.apache.flink.runtime.util.ZooKeeperUtils                  - Initialized
>> ZooKeeperCompletedCheckpointStore in
>> '/checkpoints/e5ea83238d721e932b4645589660406c'.
>> 2019-08-21 20:00:28,196 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Recovered 0
>> containers from previous attempts ([]).
>> 2019-08-21 20:00:28,199 INFO
>> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
>> yarn.client.max-cached-nodemanagers-proxies : 0
>> 2019-08-21 20:00:28,200 INFO
>> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
>> Starting ZooKeeperLeaderElectionService
>> ZooKeeperLeaderElectionService{leaderPath='/leader/resource_manager_lock'}.
>> 2019-08-21 20:00:28,203 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  - No state
>> backend has been configured, using default (Memory / JobManager)
>> MemoryStateBackend (data in heap memory / checkpoints to JobManager)
>> (checkpoints: 'null', savepoints: 'null', asynchronous: TRUE, maxStateSize:
>> 5242880)
>> 2019-08-21 20:00:28,211 INFO
>> org.apache.flink.yarn.YarnResourceManager                     -
>> ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager was
>> granted leadership with fencing token 97537c0d61bbf091e37cb6c45b784568
>> 2019-08-21 20:00:28,214 INFO
>> org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  -
>> Starting the SlotManager.
>> 2019-08-21 20:00:28,225 INFO
>> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
>> Recovering checkpoints from ZooKeeper.
>> 2019-08-21 20:00:28,229 INFO
>> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
>> Found 0 checkpoints in ZooKeeper.
>> 2019-08-21 20:00:28,229 INFO
>> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
>> Trying to fetch 0 checkpoints from storage.
>> 2019-08-21 20:00:28,233 INFO
>> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService  -
>> Starting ZooKeeperLeaderElectionService
>> ZooKeeperLeaderElectionService{leaderPath='/leader/e5ea83238d721e932b4645589660406c/job_manager_lock'}.
>> 2019-08-21 20:00:28,248 INFO
>> org.apache.flink.runtime.jobmaster.JobManagerRunner           - JobManager
>> runner for job StreamKafkaMain (e5ea83238d721e932b4645589660406c) was
>> granted leadership with session id 12c96c94-ebbd-4991-a457-6f0320d71bc1 at
>> akka.tcp://flink@host53:59609/user/jobmanager_0.
>> 2019-08-21 20:00:28,260 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
>> 2019-08-21 20:00:28,260 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  - Starting
>> execution of job StreamKafkaMain (e5ea83238d721e932b4645589660406c) under
>> job master id a4576f0320d71bc112c96c94ebbd4991.
>> 2019-08-21 20:00:28,261 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job
>> StreamKafkaMain (e5ea83238d721e932b4645589660406c) switched from state
>> CREATED to RUNNING.
>> 2019-08-21 20:00:28,266 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
>> (ee39866835f7b7387f518e352787a532) switched from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,280 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
>> serve slot request, no ResourceManager connected. Adding as pending request
>> [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}]
>> 2019-08-21 20:00:28,287 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
>> (f1a893fc5e2440331c8e6db1874ad220) switched from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,288 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
>> serve slot request, no ResourceManager connected. Adding as pending request
>> [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}]
>> 2019-08-21 20:00:28,288 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
>> (4756a0450881325af5fb396a18dbb8ae) switched from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,289 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
>> serve slot request, no ResourceManager connected. Adding as pending request
>> [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}]
>> 2019-08-21 20:00:28,289 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
>> (78f520582607e26b365fca483fc98d4c) switched from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,289 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
>> serve slot request, no ResourceManager connected. Adding as pending request
>> [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}]
>> 2019-08-21 20:00:28,289 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
>> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
>> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
>> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select:
>> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
>> (0071e1c7c4f70646914329f81dcbd349) switched from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,291 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from CREATED
>> to SCHEDULED.
>> 2019-08-21 20:00:28,291 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from CREATED
>> to SCHEDULED.
>> 2019-08-21 20:00:28,291 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from CREATED
>> to SCHEDULED.
>> 2019-08-21 20:00:28,291 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from CREATED
>> to SCHEDULED.
>> 2019-08-21 20:00:28,291 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched
>> from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,291 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched
>> from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,291 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched
>> from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,292 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched
>> from CREATED to SCHEDULED.
>> 2019-08-21 20:00:28,295 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  - Connecting
>> to ResourceManager akka.tcp://flink@host53
>> :59609/user/resourcemanager(97537c0d61bbf091e37cb6c45b784568)
>> 2019-08-21 20:00:28,300 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  - Resolved
>> ResourceManager address, beginning registration
>> 2019-08-21 20:00:28,300 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  -
>> Registration at ResourceManager attempt 1 (timeout=100ms)
>> 2019-08-21 20:00:28,304 INFO
>> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
>> Starting ZooKeeperLeaderRetrievalService
>> /leader/e5ea83238d721e932b4645589660406c/job_manager_lock.
>> 2019-08-21 20:00:28,304 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Registering
>> job manager [hidden email]://flink@host53:59609/user/jobmanager_0
>> for job e5ea83238d721e932b4645589660406c.
>> 2019-08-21 20:00:28,310 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Registered
>> job manager [hidden email]://flink@host53:59609/user/jobmanager_0
>> for job e5ea83238d721e932b4645589660406c.
>> 2019-08-21 20:00:28,313 INFO
>> org.apache.flink.runtime.jobmaster.JobMaster                  - JobManager
>> successfully registered at ResourceManager, leader id:
>> 97537c0d61bbf091e37cb6c45b784568.
>> 2019-08-21 20:00:28,314 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
>> new slot [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}] and profile
>> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
>> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
>> 2019-08-21 20:00:28,315 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Request
>> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
>> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
>> e5ea83238d721e932b4645589660406c with allocation id
>> faa6a521758e5be4b3dc1955ca0f51b4.
>> 2019-08-21 20:00:28,316 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
>> new slot [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}] and profile
>> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
>> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
>> 2019-08-21 20:00:28,316 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
>> new slot [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}] and profile
>> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
>> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
>> 2019-08-21 20:00:28,316 INFO
>> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Requesting
>> new slot [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}] and profile
>> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
>> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
>> 2019-08-21 20:00:28,321 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Requesting
>> new TaskExecutor container with resources <memory:8192, vCores:2>. Number
>> pending requests 1.
>> 2019-08-21 20:00:28,322 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Request
>> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
>> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
>> e5ea83238d721e932b4645589660406c with allocation id
>> a2b419aea4a373686eeed6c46163a528.
>> 2019-08-21 20:00:28,323 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Request
>> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
>> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
>> e5ea83238d721e932b4645589660406c with allocation id
>> cbc11737ce6729cfe45f24c43180b5c8.
>> 2019-08-21 20:00:28,323 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Requesting
>> new TaskExecutor container with resources <memory:8192, vCores:2>. Number
>> pending requests 2.
>> 2019-08-21 20:00:28,323 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Request
>> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
>> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
>> e5ea83238d721e932b4645589660406c with allocation id
>> f6566567c173a55b2bc52b72bcb24ff8.
>> 2019-08-21 20:00:31,597 INFO
>> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint
>> triggering task Source: Custom Source -> Flat Map -> Filter -> Filter ->
>> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong,
>> rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute:
>> (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state
>> RUNNING but SCHEDULED instead. Aborting checkpoint.
>> 2019-08-21 20:00:33,749 INFO
>> org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received
>> new token for : host53:59938
>> 2019-08-21 20:00:33,749 INFO
>> org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received
>> new token for : host52:33246
>> 2019-08-21 20:00:33,765 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Received
>> new container: container_1566383236573_0003_01_000002 - Remaining pending
>> container requests: 2
>> 2019-08-21 20:00:33,765 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Removing
>> container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending
>> container requests 1.
>> 2019-08-21 20:00:34,009 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Creating
>> container launch context for TaskManagers
>> 2019-08-21 20:00:34,010 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Starting
>> TaskManagers
>> 2019-08-21 20:00:34,021 INFO
>> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
>> Opening proxy : host53:59938
>> 2019-08-21 20:00:34,066 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Received
>> new container: container_1566383236573_0003_01_000003 - Remaining pending
>> container requests: 1
>> 2019-08-21 20:00:34,066 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Removing
>> container request Capability[<memory:8192, vCores:2>]Priority[1]. Pending
>> container requests 0.
>> 2019-08-21 20:00:34,088 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Creating
>> container launch context for TaskManagers
>> 2019-08-21 20:00:34,089 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Starting
>> TaskManagers
>> 2019-08-21 20:00:34,090 INFO
>> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
>> Opening proxy : host52:33246
>> 2019-08-21 20:00:36,597 INFO
>> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Checkpoint
>> triggering task Source: Custom Source -> Flat Map -> Filter -> Filter ->
>> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong,
>> rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute:
>> (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state
>> RUNNING but SCHEDULED instead. Aborting checkpoint.
>> 2019-08-21 20:00:37,856 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Registering
>> TaskManager with ResourceID container_1566383236573_0003_01_000002
>> (akka.tcp://flink@host53:34608/user/taskmanager_0) at ResourceManager
>> 2019-08-21 20:00:39,573 INFO
>> org.apache.flink.yarn.YarnResourceManager                     - Registering
>> TaskManager with ResourceID container_1566383236573_0003_01_000003
>> (akka.tcp://flink@host52:32931/user/taskmanager_0) at ResourceManager
>> 2019-08-21 20:00:39,659 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
>> (ee39866835f7b7387f518e352787a532) switched from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,659 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
>> (attempt #0) to container_1566383236573_0003_01_000002 @ host53
>> (dataPort=52456)
>> 2019-08-21 20:00:39,664 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
>> (f1a893fc5e2440331c8e6db1874ad220) switched from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,664 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
>> (attempt #0) to container_1566383236573_0003_01_000002 @ host53
>> (dataPort=52456)
>> 2019-08-21 20:00:39,665 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
>> (4756a0450881325af5fb396a18dbb8ae) switched from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,666 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
>> (attempt #0) to container_1566383236573_0003_01_000003 @ host52
>> (dataPort=39650)
>> 2019-08-21 20:00:39,666 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
>> (78f520582607e26b365fca483fc98d4c) switched from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,666 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
>> (attempt #0) to container_1566383236573_0003_01_000003 @ host52
>> (dataPort=39650)
>> 2019-08-21 20:00:39,666 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
>> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
>> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
>> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select:
>> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
>> (0071e1c7c4f70646914329f81dcbd349) switched from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,666 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select:
>> (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start,
>> end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime)
>> -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1) (attempt
>> #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
>> 2019-08-21 20:00:39,670 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from
>> SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,670 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Sink: Print to Std. Out (1/4) (attempt #0) to
>> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
>> 2019-08-21 20:00:39,671 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from
>> SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,672 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Sink: Print to Std. Out (2/4) (attempt #0) to
>> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
>> 2019-08-21 20:00:39,672 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from
>> SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,672 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Sink: Print to Std. Out (3/4) (attempt #0) to
>> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
>> 2019-08-21 20:00:39,672 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from
>> SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,673 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Sink: Print to Std. Out (4/4) (attempt #0) to
>> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
>> 2019-08-21 20:00:39,673 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched
>> from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,673 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Map -> Sink: Print to Std. Out (1/4) (attempt #0) to
>> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
>> 2019-08-21 20:00:39,675 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched
>> from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,675 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Map -> Sink: Print to Std. Out (2/4) (attempt #0) to
>> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
>> 2019-08-21 20:00:39,675 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched
>> from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,675 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Map -> Sink: Print to Std. Out (3/4) (attempt #0) to
>> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
>> 2019-08-21 20:00:39,676 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched
>> from SCHEDULED to DEPLOYING.
>> 2019-08-21 20:00:39,676 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Deploying
>> Map -> Sink: Print to Std. Out (4/4) (attempt #0) to
>> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
>> 2019-08-21 20:00:39,782 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from
>> DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,783 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from
>> DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,784 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01) switched
>> from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,787 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f) switched
>> from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,806 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (2/4)
>> (f1a893fc5e2440331c8e6db1874ad220) switched from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,809 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (1/4)
>> (ee39866835f7b7387f518e352787a532) switched from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,809 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
>> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
>> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
>> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) -> select:
>> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
>> (0071e1c7c4f70646914329f81dcbd349) switched from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,991 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from
>> DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,992 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink: Print
>> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from
>> DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,996 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e) switched
>> from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:39,997 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
>> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83) switched
>> from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:40,005 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (4/4)
>> (78f520582607e26b365fca483fc98d4c) switched from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:40,006 INFO
>> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
>> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
>> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime) ->
>> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime) (3/4)
>> (4756a0450881325af5fb396a18dbb8ae) switched from DEPLOYING to RUNNING.
>> 2019-08-21 20:00:41,605 INFO
>> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Triggering
>> checkpoint 1 @ 1566388841597 for job e5ea83238d721e932b4645589660406c.
>> 2019-08-21 20:00:42,466 INFO
>> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     - Completed
>> checkpoint 1 for job e5ea83238d721e932b4645589660406c (7994 bytes in 821
>> ms).
>>
>>
>>
>> ------------------ 原始邮件 ------------------
>> 发件人: "Yun Tang"<[hidden email]>;
>> 发送时间: 2019年8月22日(星期四) 中午11:08
>> 收件人: "user-zh"<[hidden email]>;
>>
>> 主题: Re: flink启动等待10分钟问题
>>
>>
>>
>> Hi
>>
>> Flink on YARN作业启动时间长,有很多原因,例如资源不够在等待,container申请的时候又退出了。默认的slot
>> request的timeout时间是5min,感觉你的作业应该是可能遇到了一个slot request
>> timeout,然后又重新申请。最好能提供一下jobmanager的日志才好进一步分析。
>>
>> 祝好
>> 唐云
>> ________________________________
>> From: 々守护々 <[hidden email]>
>> Sent: Thursday, August 22, 2019 11:04
>> To: user-zh <[hidden email]>
>> Subject: flink启动等待10分钟问题
>>
>> 各位Flink社区大佬,您好!
>>       我使用Flink SQL (Flink 1.8.1),在hadoop 集群中调整了一下yarn-site.xml文件的内存参数:
>>         <property>
>>                 <name>yarn.nodemanager.resource.memory-mb</name>
>>                 <value>16384</value>
>>         </property>
>>         <property>
>>                 <name>yarn.scheduler.minimum-allocation-mb</name>
>>                 <value>1024</value>
>>         </property>
>>         <property>
>>                 <name>yarn.scheduler.maximum-allocation-mb</name>
>>                 <value>8192</value>
>>         </property>
>>
>>
>>
>> 然后我去启动flink on yarn任务,启动参数:./flink-1.8.1/bin/flink run -m yarn-cluster -p
>> 4 -d -c streaming.StreamKafkaMain -yn 4 -ys 2 -yjm 2048 -ytm 8192  -ynm
>> bigdata6 ./flink_bigdata6.jar
>> 最终能能启动成功,就是启动要等待10分钟,这个等待时间太长了,请问各位大佬有什么解决的方法吗?
>>
>

Reply | Threaded
Open this post in threaded view
|

Re: 回复: flink启动等待10分钟问题

tison
基本上你是卡在上传用户 jar 这一步了,提交任务到部署成功是一瞬间的

2019-08-22 11:38:02,185 INFO
org.apache.flink.yarn.AbstractYarnClusterDescriptor           - Submitting
application master application_1566383236573_0004
2019-08-22 11:38:02,226 INFO
org.apache.hadoop.yarn.client.api.impl.YarnClientImpl         - Submitted
application application_1566383236573_0004

中间隔开的时间 FLINK 干的事情就是上传用户依赖的 jar 包

Best,
tison.


々守护々 <[hidden email]> 于2019年8月22日周四 上午11:49写道:

> 这个是终端打印的日志,十分钟后才启动- Submitting application master
> application_1566383236573_0004
>
>
>
> 2019-08-22 11:28:21,766 WARN
> org.apache.flink.yarn.AbstractYarnClusterDescriptor           - This YARN
> session requires 34816MB of memory in the cluster. There are currently only
> 26624MB available.
> The Flink YARN client will try to allocate the YARN session, but maybe not
> all TaskManagers are connecting from the beginning because the resources
> are currently not available in the cluster. The allocation might take more
> time than usual because the Flink YARN client needs to wait until the
> resources become available.
> 2019-08-22 11:28:21,766 WARN
> org.apache.flink.yarn.AbstractYarnClusterDescriptor           - There is
> not enough memory available in the YARN cluster. The TaskManager(s) require
> 8192MB each. NodeManagers available: [14336, 12288]
> After allocating the JobManager (2048MB) and (2/4) TaskManagers, the
> following NodeManagers are available: [4096, 4096]
> The Flink YARN client will try to allocate the YARN session, but maybe not
> all TaskManagers are connecting from the beginning because the resources
> are currently not available in the cluster. The allocation might take more
> time than usual because the Flink YARN client needs to wait until the
> resources become available.
> 2019-08-22 11:28:21,766 WARN
> org.apache.flink.yarn.AbstractYarnClusterDescriptor           - There is
> not enough memory available in the YARN cluster. The TaskManager(s) require
> 8192MB each. NodeManagers available: [14336, 12288]
> After allocating the JobManager (2048MB) and (3/4) TaskManagers, the
> following NodeManagers are available: [4096, 4096]
> The Flink YARN client will try to allocate the YARN session, but maybe not
> all TaskManagers are connecting from the beginning because the resources
> are currently not available in the cluster. The allocation might take more
> time than usual because the Flink YARN client needs to wait until the
> resources become available.
> 2019-08-22 11:28:21,767 INFO
> org.apache.flink.yarn.AbstractYarnClusterDescriptor           - Cluster
> specification: ClusterSpecification{masterMemoryMB=2048,
> taskManagerMemoryMB=8192, numberTaskManagers=4, slotsPerTaskManager=2}
> 2019-08-22 11:28:22,317 WARN
> org.apache.flink.yarn.AbstractYarnClusterDescriptor           - The
> configuration directory ('/usr/flink/flink-1.8.1/conf') contains both LOG4J
> and Logback configuration files. Please delete or rename one of them.
> 2019-08-22 11:38:02,185 INFO
> org.apache.flink.yarn.AbstractYarnClusterDescriptor           - Submitting
> application master application_1566383236573_0004
> 2019-08-22 11:38:02,226 INFO
> org.apache.hadoop.yarn.client.api.impl.YarnClientImpl         - Submitted
> application application_1566383236573_0004
> 2019-08-22 11:38:02,226 INFO
> org.apache.flink.yarn.AbstractYarnClusterDescriptor           - Waiting for
> the cluster to be allocated
> 2019-08-22 11:38:02,228 INFO
> org.apache.flink.yarn.AbstractYarnClusterDescriptor           - Deploying
> cluster, current state ACCEPTED
> 2019-08-22 11:38:07,244 INFO
> org.apache.flink.yarn.AbstractYarnClusterDescriptor           - YARN
> application has been deployed successfully.
> 2019-08-22 11:38:07,244 INFO
> org.apache.flink.yarn.AbstractYarnClusterDescriptor           - The Flink
> YARN client has been started in detached mode. In order to stop Flink on
> YARN, use the following command or a YARN web interface to stop it:
> yarn application -kill application_1566383236573_0004
> Please also note that the temporary files of the YARN session in the home
> directory will not be removed.
> Job has been submitted with JobID c16c4fc1556ccb2ceaaa2f0e8f32ec88
>
> ------------------ 原始邮件 ------------------
> *发件人:* "Zili Chen"<[hidden email]>;
> *发送时间:* 2019年8月22日(星期四) 中午11:37
> *收件人:* "user-zh"<[hidden email]>;
> *主题:* Re: 回复: flink启动等待10分钟问题
>
> 你说的【停在那儿了】是说 flink run 的终端输出不动了吗?你看一下这个终端输出里 YARN 是什么时候 accept 你的应用的,我怀疑是
> YARN 集群忙导致 10 分钟没响应。
>
> Best,
> tison.
>
>
> Zili Chen <[hidden email]> 于2019年8月22日周四 上午11:35写道:
>
> > user-zh 不支持贴图,你用下第三方存储然后贴个链接吧,或者我记得可以传邮件附件
> >
> > Best,
> > tison.
> >
> >
> > 々守护々 <[hidden email]> 于2019年8月22日周四 上午11:33写道:
> >
> >> 是这样的,在启动的是后客户端日志就停在那了,等待10分钟后才能生成yarn的application
> >> <http://hadoop.changba.com/cluster/app/application_1566383236573_0003>
> >>
> >>
> >> ------------------ 原始邮件 ------------------
> >> *发件人:* "Yun Tang"<[hidden email]>;
> >> *发送时间:* 2019年8月22日(星期四) 中午11:23
> >> *收件人:* "user-zh"<[hidden email]>;
> >> *主题:* Re: 回复: flink启动等待10分钟问题
> >>
> >> 启动时间是 20:00:25,task都处于running甚至第一次checkpoint
> >> completed的时间是20:00:42,一共才17秒,何来10分钟的问题?
> >> ________________________________
> >> From: 々守护々 <[hidden email]>
> >> Sent: Thursday, August 22, 2019 11:18
> >> To: user-zh <[hidden email]>
> >> Subject: 回复: flink启动等待10分钟问题
> >>
> >> 您好,这个是我jobmanager启动日志,请帮忙看看,谢谢!
> >>
> >>
> >> 2019-08-21 20:00:25,428 INFO
> >> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -
> >>
> --------------------------------------------------------------------------------
> >> 2019-08-21 20:00:25,430 INFO
> >> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -
> Registered
> >> UNIX signal handlers for [TERM, HUP, INT]
> >> 2019-08-21 20:00:25,433 INFO
> >> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - YARN
> daemon
> >> is running as: root Yarn client user obtainer: root
> >> 2019-08-21 20:00:25,437 INFO
> >> org.apache.flink.configuration.GlobalConfiguration            - Loading
> >> configuration property: high-availability.cluster-id,
> >> application_1566383236573_0003
> >> 2019-08-21 20:00:25,437 INFO
> >> org.apache.flink.configuration.GlobalConfiguration            - Loading
> >> configuration property: jobmanager.rpc.address, localhost
> >> 2019-08-21 20:00:25,437 INFO
> >> org.apache.flink.configuration.GlobalConfiguration            - Loading
> >> configuration property: jobmanager.rpc.port, 6123
> >> 2019-08-21 20:00:25,437 INFO
> >> org.apache.flink.configuration.GlobalConfiguration            - Loading
> >> configuration property: high-availability.zookeeper.path.root,
> /flink-yarn
> >> 2019-08-21 20:00:25,437 INFO
> >> org.apache.flink.configuration.GlobalConfiguration            - Loading
> >> configuration property: high-availability.storageDir,
> >> hdfs://host51:9000/flink/ha-yarn
> >> 2019-08-21 20:00:25,438 INFO
> >> org.apache.flink.configuration.GlobalConfiguration            - Loading
> >> configuration property: high-availability.zookeeper.quorum, host51:2181
> >> 2019-08-21 20:00:25,438 INFO
> >> org.apache.flink.configuration.GlobalConfiguration            - Loading
> >> configuration property: internal.cluster.execution-mode, DETACHED
> >> 2019-08-21 20:00:25,438 INFO
> >> org.apache.flink.configuration.GlobalConfiguration            - Loading
> >> configuration property: high-availability, zookeeper
> >> 2019-08-21 20:00:25,438 INFO
> >> org.apache.flink.configuration.GlobalConfiguration            - Loading
> >> configuration property: parallelism.default, 1
> >> 2019-08-21 20:00:25,438 INFO
> >> org.apache.flink.configuration.GlobalConfiguration            - Loading
> >> configuration property: yarn.application-attempts, 10
> >> 2019-08-21 20:00:25,438 INFO
> >> org.apache.flink.configuration.GlobalConfiguration            - Loading
> >> configuration property: taskmanager.numberOfTaskSlots, 2
> >> 2019-08-21 20:00:25,439 INFO
> >> org.apache.flink.configuration.GlobalConfiguration            - Loading
> >> configuration property: taskmanager.heap.size, 8192m
> >> 2019-08-21 20:00:25,439 INFO
> >> org.apache.flink.configuration.GlobalConfiguration            - Loading
> >> configuration property: jobmanager.heap.size, 2048m
> >> 2019-08-21 20:00:25,465 WARN
> >> org.apache.flink.configuration.Configuration                  - Config
> uses
> >> deprecated configuration key 'web.port' instead of proper key
> >> 'rest.bind-port'
> >> 2019-08-21 20:00:25,469 INFO
> >> org.apache.flink.runtime.clusterframework.BootstrapTools      - Setting
> >> directories for temporary files to:
> >>
> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003
> >> 2019-08-21 20:00:25,485 INFO
> >> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Starting
> >> YarnJobClusterEntrypoint.
> >> 2019-08-21 20:00:25,485 INFO
> >> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Install
> >> default filesystem.
> >> 2019-08-21 20:00:25,561 INFO
> >> org.apache.flink.runtime.security.modules.HadoopModule        - Hadoop
> user
> >> set to root (auth:SIMPLE)
> >> 2019-08-21 20:00:25,580 INFO
> >> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         -
> >> Initializing cluster services.
> >> 2019-08-21 20:00:25,757 INFO
> >> org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Trying
> to
> >> start actor system at host53:0
> >> 2019-08-21 20:00:26,187 INFO
> >> akka.event.slf4j.Slf4jLogger                                  -
> Slf4jLogger
> >> started
> >> 2019-08-21 20:00:26,251 INFO
> >> akka.remote.Remoting                                          - Starting
> >> remoting
> >> 2019-08-21 20:00:26,439 INFO
> >> akka.remote.Remoting                                          - Remoting
> >> started; listening on addresses :[akka.tcp://flink@host53:59609]
> >> 2019-08-21 20:00:26,447 INFO
> >> org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils         - Actor
> >> system started at akka.tcp://flink@host53:59609
> >> 2019-08-21 20:00:27,011 INFO
> >> org.apache.flink.runtime.blob.FileSystemBlobStore             - Creating
> >> highly available BLOB storage directory at
> >> hdfs://host51:9000/flink/ha-yarn/application_1566383236573_0003/blob
> >> 2019-08-21 20:00:27,097 INFO
> >> org.apache.flink.runtime.util.ZooKeeperUtils                  -
> Enforcing
> >> default ACL for ZK connections
> >> 2019-08-21 20:00:27,097 INFO
> >> org.apache.flink.runtime.util.ZooKeeperUtils                  - Using
> >> '/flink-yarn/application_1566383236573_0003' as Zookeeper namespace.
> >> 2019-08-21 20:00:27,161 INFO
> >>
> org.apache.flink.shaded.curator.org.apache.curator.framework.imps.CuratorFrameworkImpl
> >> - Starting
> >> 2019-08-21 20:00:27,169 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Client
> >>
> environment:zookeeper.version=3.4.10-39d3a4f269333c922ed3db283be479f9deacaa0f,
> >> built on 03/23/2017 10:13 GMT
> >> 2019-08-21 20:00:27,169 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Client
> >> environment:host.name=host53
> >> 2019-08-21 20:00:27,169 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Client
> >> environment:java.version=1.8.0_77
> >> 2019-08-21 20:00:27,169 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Client
> >> environment:java.vendor=Oracle Corporation
> >> 2019-08-21 20:00:27,169 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Client
> >> 2019-08-21 20:00:27,170 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Client
> >>
> environment:java.library.path=:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/hadoop/hadoop-2.7.7/lib/native:/usr/java/packages/lib/amd64:/usr/lib64:/lib64:/lib:/usr/lib
> >> 2019-08-21 20:00:27,170 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Client
> >> environment:java.io.tmpdir=/tmp
> >> 2019-08-21 20:00:27,170 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Client
> >> environment:java.compiler=<NA>
> >> 2019-08-21 20:00:27,170 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Client
> >> environment:os.name=Linux
> >> 2019-08-21 20:00:27,170 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Client
> >> environment:os.arch=amd64
> >> 2019-08-21 20:00:27,170 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Client
> >> environment:os.version=3.16.0-0.bpo.4-amd64
> >> 2019-08-21 20:00:27,170 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Client
> >> environment:user.name=root
> >> 2019-08-21 20:00:27,170 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Client
> >> environment:user.home=/root
> >> 2019-08-21 20:00:27,170 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> Client
> >>
> environment:user.dir=/usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/container_1566383236573_0003_01_000001
> >> 2019-08-21 20:00:27,171 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper  -
> >> Initiating client connection, connectString=host51:2181
> >> sessionTimeout=60000
> >>
> watcher=org.apache.flink.shaded.curator.org.apache.curator.ConnectionState@319854f0
> >> 2019-08-21 20:00:27,184 WARN
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  -
> SASL
> >> configuration failed: javax.security.auth.login.LoginException: No JAAS
> >> configuration section named 'Client' was found in specified JAAS
> >> configuration file: '/tmp/jaas-4274640356638794035.conf'. Will continue
> >> connection to Zookeeper server without SASL authentication, if Zookeeper
> >> server allows it.
> >> 2019-08-21 20:00:27,186 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  -
> >> Opening socket connection to server host51/10.10.10.51:2181
> >> 2019-08-21 20:00:27,187 ERROR
> >> org.apache.flink.shaded.curator.org.apache.curator.ConnectionState  -
> >> Authentication failed
> >> 2019-08-21 20:00:27,187 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  -
> Socket
> >> connection established to host51/10.10.10.51:2181, initiating session
> >> 2019-08-21 20:00:27,193 INFO
> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn  -
> >> Session establishment complete on server host51/10.10.10.51:2181,
> >> sessionid = 0x1001a86de810008, negotiated timeout = 40000
> >> 2019-08-21 20:00:27,193 INFO
> >> org.apache.flink.runtime.blob.BlobServer                      - Created
> >> BLOB server storage directory
> >>
> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-eaa2bb2b-cddb-42e3-8ce5-bf2f83e8567a
> >> 2019-08-21 20:00:27,195 INFO
> >>
> org.apache.flink.shaded.curator.org.apache.curator.framework.state.ConnectionStateManager
> >> - State change: CONNECTED
> >> 2019-08-21 20:00:27,198 INFO
> >> org.apache.flink.runtime.blob.BlobServer                      - Started
> >> BLOB server at 0.0.0.0:58591 - max concurrent requests: 50 - max
> >> backlog: 1000
> >> 2019-08-21 20:00:27,213 INFO
> >> org.apache.flink.runtime.metrics.MetricRegistryImpl           - No
> metrics
> >> reporter configured, no metrics will be exposed/reported.
> >> 2019-08-21 20:00:27,215 INFO
> >> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Trying
> to
> >> start actor system at host53:0
> >> 2019-08-21 20:00:27,239 INFO
> >> akka.event.slf4j.Slf4jLogger                                  -
> Slf4jLogger
> >> started
> >> 2019-08-21 20:00:27,247 INFO
> >> akka.remote.Remoting                                          - Starting
> >> remoting
> >> 2019-08-21 20:00:27,255 INFO
> >> akka.remote.Remoting                                          - Remoting
> >> started; listening on addresses :[akka.tcp://flink-metrics@host53
> :59941]
> >> 2019-08-21 20:00:27,257 INFO
> >> org.apache.flink.runtime.entrypoint.ClusterEntrypoint         - Actor
> >> system started at akka.tcp://flink-metrics@host53:59941
> >> 2019-08-21 20:00:27,263 INFO
> >> org.apache.flink.runtime.blob.TransientBlobCache              - Created
> >> BLOB cache storage directory
> >>
> /usr/hadoop/hadoop-2.7.7/tmp/nm-local-dir/usercache/root/appcache/application_1566383236573_0003/blobStore-405a4797-824f-48da-bb50-2ee2e230d249
> >> 2019-08-21 20:00:27,296 WARN
> >> org.apache.flink.configuration.Configuration                  - Config
> uses
> >> deprecated configuration key 'web.port' instead of proper key
> >> 'rest.bind-port'
> >> 2019-08-21 20:00:27,298 WARN
> >> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Upload
> >> directory
> >> /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload
> does
> >> not exist, or has been deleted externally. Previously uploaded files
> are no
> >> longer available.
> >> 2019-08-21 20:00:27,298 INFO
> >> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Created
> >> directory
> >> /tmp/flink-web-57c9cddd-64af-4e3d-96a9-4f9e78d20b86/flink-web-upload for
> >> file uploads.
> >> 2019-08-21 20:00:27,320 INFO
> >> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  -
> Starting
> >> rest endpoint.
> >> 2019-08-21 20:00:27,616 INFO
> >> org.apache.flink.runtime.webmonitor.WebMonitorUtils           -
> Determined
> >> location of main cluster component log file:
> >>
> /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.log
> >> 2019-08-21 20:00:27,616 INFO
> >> org.apache.flink.runtime.webmonitor.WebMonitorUtils           -
> Determined
> >> location of main cluster component stdout file:
> >>
> /usr/hadoop/hadoop-2.7.7/logs/userlogs/application_1566383236573_0003/container_1566383236573_0003_01_000001/jobmanager.out
> >> 2019-08-21 20:00:27,771 INFO
> >> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Rest
> >> endpoint listening at host53:46401
> >> 2019-08-21 20:00:27,772 INFO
> >> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService
> -
> >> Starting ZooKeeperLeaderElectionService
> >> ZooKeeperLeaderElectionService{leaderPath='/leader/rest_server_lock'}.
> >> 2019-08-21 20:00:27,795 WARN
> >> org.apache.flink.shaded.curator.org.apache.curator.utils.ZKPaths  - The
> >> version of ZooKeeper being used doesn't support Container nodes.
> >> CreateMode.PERSISTENT will be used instead.
> >> 2019-08-21 20:00:27,807 INFO
> >> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  - Web
> >> frontend listening at http://host53:46401.
> >> 2019-08-21 20:00:27,836 INFO
> >> org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint  -
> >> http://host53:46401 was granted leadership with
> >> leaderSessionID=8487757f-fd82-42b2-a1a6-adcc69e1eba1
> >> 2019-08-21 20:00:27,932 INFO
> >> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
> >> RPC endpoint for org.apache.flink.yarn.YarnResourceManager at
> >> akka://flink/user/resourcemanager .
> >> 2019-08-21 20:00:28,021 INFO
> >> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
> >> RPC endpoint for org.apache.flink.runtime.dispatcher.MiniDispatcher at
> >> akka://flink/user/dispatcher .
> >> 2019-08-21 20:00:28,033 INFO
> >>
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> >> Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
> >> 2019-08-21 20:00:28,033 INFO
> >>
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> >> Starting ZooKeeperLeaderRetrievalService /leader/dispatcher_lock.
> >> 2019-08-21 20:00:28,033 INFO
> >> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService
> -
> >> Starting ZooKeeperLeaderElectionService
> >> ZooKeeperLeaderElectionService{leaderPath='/leader/dispatcher_lock'}.
> >> 2019-08-21 20:00:28,045 INFO
> >> org.apache.flink.runtime.dispatcher.MiniDispatcher            -
> Dispatcher
> >> akka.tcp://flink@host53:59609/user/dispatcher was granted leadership
> >> with fencing token 6216d17e-56b1-43bb-9bb3-f9fa5b8921be
> >> 2019-08-21 20:00:28,048 INFO
> >> org.apache.flink.runtime.dispatcher.MiniDispatcher            -
> Recovering
> >> all persisted jobs.
> >> 2019-08-21 20:00:28,054 INFO
> >> org.apache.hadoop.yarn.client.RMProxy                         -
> Connecting
> >> to ResourceManager at host51/10.10.10.51:8030
> >> 2019-08-21 20:00:28,078 INFO
> >> org.apache.flink.runtime.rpc.akka.AkkaRpcService              - Starting
> >> RPC endpoint for org.apache.flink.runtime.jobmaster.JobMaster at
> >> akka://flink/user/jobmanager_0 .
> >> 2019-08-21 20:00:28,086 INFO
> >> org.apache.flink.runtime.jobmaster.JobMaster                  -
> >> Initializing job StreamKafkaMain (e5ea83238d721e932b4645589660406c).
> >> 2019-08-21 20:00:28,099 INFO
> >> org.apache.flink.runtime.jobmaster.JobMaster                  - Using
> >> restart strategy
> >> FixedDelayRestartStrategy(maxNumberRestartAttempts=2147483647,
> >> delayBetweenRestartAttempts=0) for StreamKafkaMain
> >> (e5ea83238d721e932b4645589660406c).
> >> 2019-08-21 20:00:28,122 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job
> >> recovers via failover strategy: full graph restart
> >> 2019-08-21 20:00:28,153 INFO
> >> org.apache.flink.runtime.jobmaster.JobMaster                  - Running
> >> initialization on master for job StreamKafkaMain
> >> (e5ea83238d721e932b4645589660406c).
> >> 2019-08-21 20:00:28,154 INFO
> >> org.apache.flink.runtime.jobmaster.JobMaster                  -
> >> Successfully ran initialization on master in 0 ms.
> >> 2019-08-21 20:00:28,187 INFO
> >> org.apache.flink.runtime.util.ZooKeeperUtils                  -
> Initialized
> >> ZooKeeperCompletedCheckpointStore in
> >> '/checkpoints/e5ea83238d721e932b4645589660406c'.
> >> 2019-08-21 20:00:28,196 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     -
> Recovered 0
> >> containers from previous attempts ([]).
> >> 2019-08-21 20:00:28,199 INFO
> >>
> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
> >> yarn.client.max-cached-nodemanagers-proxies : 0
> >> 2019-08-21 20:00:28,200 INFO
> >> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService
> -
> >> Starting ZooKeeperLeaderElectionService
> >>
> ZooKeeperLeaderElectionService{leaderPath='/leader/resource_manager_lock'}.
> >> 2019-08-21 20:00:28,203 INFO
> >> org.apache.flink.runtime.jobmaster.JobMaster                  - No state
> >> backend has been configured, using default (Memory / JobManager)
> >> MemoryStateBackend (data in heap memory / checkpoints to JobManager)
> >> (checkpoints: 'null', savepoints: 'null', asynchronous: TRUE,
> maxStateSize:
> >> 5242880)
> >> 2019-08-21 20:00:28,211 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     -
> >> ResourceManager akka.tcp://flink@host53:59609/user/resourcemanager was
> >> granted leadership with fencing token 97537c0d61bbf091e37cb6c45b784568
> >> 2019-08-21 20:00:28,214 INFO
> >> org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  -
> >> Starting the SlotManager.
> >> 2019-08-21 20:00:28,225 INFO
> >> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
> >> Recovering checkpoints from ZooKeeper.
> >> 2019-08-21 20:00:28,229 INFO
> >> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
> >> Found 0 checkpoints in ZooKeeper.
> >> 2019-08-21 20:00:28,229 INFO
> >> org.apache.flink.runtime.checkpoint.ZooKeeperCompletedCheckpointStore  -
> >> Trying to fetch 0 checkpoints from storage.
> >> 2019-08-21 20:00:28,233 INFO
> >> org.apache.flink.runtime.leaderelection.ZooKeeperLeaderElectionService
> -
> >> Starting ZooKeeperLeaderElectionService
> >>
> ZooKeeperLeaderElectionService{leaderPath='/leader/e5ea83238d721e932b4645589660406c/job_manager_lock'}.
> >> 2019-08-21 20:00:28,248 INFO
> >> org.apache.flink.runtime.jobmaster.JobManagerRunner           -
> JobManager
> >> runner for job StreamKafkaMain (e5ea83238d721e932b4645589660406c) was
> >> granted leadership with session id 12c96c94-ebbd-4991-a457-6f0320d71bc1
> at
> >> akka.tcp://flink@host53:59609/user/jobmanager_0.
> >> 2019-08-21 20:00:28,260 INFO
> >>
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> >> Starting ZooKeeperLeaderRetrievalService /leader/resource_manager_lock.
> >> 2019-08-21 20:00:28,260 INFO
> >> org.apache.flink.runtime.jobmaster.JobMaster                  - Starting
> >> execution of job StreamKafkaMain (e5ea83238d721e932b4645589660406c)
> under
> >> job master id a4576f0320d71bc112c96c94ebbd4991.
> >> 2019-08-21 20:00:28,261 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Job
> >> StreamKafkaMain (e5ea83238d721e932b4645589660406c) switched from state
> >> CREATED to RUNNING.
> >> 2019-08-21 20:00:28,266 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> >> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (1/4)
> >> (ee39866835f7b7387f518e352787a532) switched from CREATED to SCHEDULED.
> >> 2019-08-21 20:00:28,280 INFO
> >> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> >> serve slot request, no ResourceManager connected. Adding as pending
> request
> >> [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}]
> >> 2019-08-21 20:00:28,287 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> >> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (2/4)
> >> (f1a893fc5e2440331c8e6db1874ad220) switched from CREATED to SCHEDULED.
> >> 2019-08-21 20:00:28,288 INFO
> >> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> >> serve slot request, no ResourceManager connected. Adding as pending
> request
> >> [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}]
> >> 2019-08-21 20:00:28,288 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> >> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (3/4)
> >> (4756a0450881325af5fb396a18dbb8ae) switched from CREATED to SCHEDULED.
> >> 2019-08-21 20:00:28,289 INFO
> >> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> >> serve slot request, no ResourceManager connected. Adding as pending
> request
> >> [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}]
> >> 2019-08-21 20:00:28,289 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> >> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (4/4)
> >> (78f520582607e26b365fca483fc98d4c) switched from CREATED to SCHEDULED.
> >> 2019-08-21 20:00:28,289 INFO
> >> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      - Cannot
> >> serve slot request, no ResourceManager connected. Adding as pending
> request
> >> [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}]
> >> 2019-08-21 20:00:28,289 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
> >> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
> >> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
> >> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) ->
> select:
> >> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
> >> (0071e1c7c4f70646914329f81dcbd349) switched from CREATED to SCHEDULED.
> >> 2019-08-21 20:00:28,291 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink:
> Print
> >> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from
> CREATED
> >> to SCHEDULED.
> >> 2019-08-21 20:00:28,291 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink:
> Print
> >> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from
> CREATED
> >> to SCHEDULED.
> >> 2019-08-21 20:00:28,291 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink:
> Print
> >> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from
> CREATED
> >> to SCHEDULED.
> >> 2019-08-21 20:00:28,291 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink:
> Print
> >> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from
> CREATED
> >> to SCHEDULED.
> >> 2019-08-21 20:00:28,291 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> >> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e)
> switched
> >> from CREATED to SCHEDULED.
> >> 2019-08-21 20:00:28,291 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> >> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83)
> switched
> >> from CREATED to SCHEDULED.
> >> 2019-08-21 20:00:28,291 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> >> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f)
> switched
> >> from CREATED to SCHEDULED.
> >> 2019-08-21 20:00:28,292 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> >> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01)
> switched
> >> from CREATED to SCHEDULED.
> >> 2019-08-21 20:00:28,295 INFO
> >> org.apache.flink.runtime.jobmaster.JobMaster                  -
> Connecting
> >> to ResourceManager akka.tcp://flink@host53
> >> :59609/user/resourcemanager(97537c0d61bbf091e37cb6c45b784568)
> >> 2019-08-21 20:00:28,300 INFO
> >> org.apache.flink.runtime.jobmaster.JobMaster                  - Resolved
> >> ResourceManager address, beginning registration
> >> 2019-08-21 20:00:28,300 INFO
> >> org.apache.flink.runtime.jobmaster.JobMaster                  -
> >> Registration at ResourceManager attempt 1 (timeout=100ms)
> >> 2019-08-21 20:00:28,304 INFO
> >>
> org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService  -
> >> Starting ZooKeeperLeaderRetrievalService
> >> /leader/e5ea83238d721e932b4645589660406c/job_manager_lock.
> >> 2019-08-21 20:00:28,304 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     -
> Registering
> >> job manager [hidden email]://flink@host53
> :59609/user/jobmanager_0
> >> for job e5ea83238d721e932b4645589660406c.
> >> 2019-08-21 20:00:28,310 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     -
> Registered
> >> job manager [hidden email]://flink@host53
> :59609/user/jobmanager_0
> >> for job e5ea83238d721e932b4645589660406c.
> >> 2019-08-21 20:00:28,313 INFO
> >> org.apache.flink.runtime.jobmaster.JobMaster                  -
> JobManager
> >> successfully registered at ResourceManager, leader id:
> >> 97537c0d61bbf091e37cb6c45b784568.
> >> 2019-08-21 20:00:28,314 INFO
> >> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      -
> Requesting
> >> new slot [SlotRequestId{13970fab9312acf9d5063b1d1a33aa0e}] and profile
> >> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> >> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> >> 2019-08-21 20:00:28,315 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     - Request
> >> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
> >> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
> >> e5ea83238d721e932b4645589660406c with allocation id
> >> faa6a521758e5be4b3dc1955ca0f51b4.
> >> 2019-08-21 20:00:28,316 INFO
> >> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      -
> Requesting
> >> new slot [SlotRequestId{fdbb2e71c9bc795452687396abff7bf8}] and profile
> >> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> >> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> >> 2019-08-21 20:00:28,316 INFO
> >> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      -
> Requesting
> >> new slot [SlotRequestId{0621470f796aa3ea4291d1b8b423e428}] and profile
> >> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> >> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> >> 2019-08-21 20:00:28,316 INFO
> >> org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl      -
> Requesting
> >> new slot [SlotRequestId{f6123820ffb8b3e906b876e9dea4497b}] and profile
> >> ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1, directMemoryInMB=0,
> >> nativeMemoryInMB=0, networkMemoryInMB=0} from resource manager.
> >> 2019-08-21 20:00:28,321 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     -
> Requesting
> >> new TaskExecutor container with resources <memory:8192, vCores:2>.
> Number
> >> pending requests 1.
> >> 2019-08-21 20:00:28,322 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     - Request
> >> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
> >> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
> >> e5ea83238d721e932b4645589660406c with allocation id
> >> a2b419aea4a373686eeed6c46163a528.
> >> 2019-08-21 20:00:28,323 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     - Request
> >> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
> >> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
> >> e5ea83238d721e932b4645589660406c with allocation id
> >> cbc11737ce6729cfe45f24c43180b5c8.
> >> 2019-08-21 20:00:28,323 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     -
> Requesting
> >> new TaskExecutor container with resources <memory:8192, vCores:2>.
> Number
> >> pending requests 2.
> >> 2019-08-21 20:00:28,323 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     - Request
> >> slot with profile ResourceProfile{cpuCores=-1.0, heapMemoryInMB=-1,
> >> directMemoryInMB=0, nativeMemoryInMB=0, networkMemoryInMB=0} for job
> >> e5ea83238d721e932b4645589660406c with allocation id
> >> f6566567c173a55b2bc52b72bcb24ff8.
> >> 2019-08-21 20:00:31,597 INFO
> >> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     -
> Checkpoint
> >> triggering task Source: Custom Source -> Flat Map -> Filter -> Filter ->
> >> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong,
> >> rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute:
> >> (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state
> >> RUNNING but SCHEDULED instead. Aborting checkpoint.
> >> 2019-08-21 20:00:33,749 INFO
> >> org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received
> >> new token for : host53:59938
> >> 2019-08-21 20:00:33,749 INFO
> >> org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl         - Received
> >> new token for : host52:33246
> >> 2019-08-21 20:00:33,765 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     - Received
> >> new container: container_1566383236573_0003_01_000002 - Remaining
> pending
> >> container requests: 2
> >> 2019-08-21 20:00:33,765 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     - Removing
> >> container request Capability[<memory:8192, vCores:2>]Priority[1].
> Pending
> >> container requests 1.
> >> 2019-08-21 20:00:34,009 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     - Creating
> >> container launch context for TaskManagers
> >> 2019-08-21 20:00:34,010 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     - Starting
> >> TaskManagers
> >> 2019-08-21 20:00:34,021 INFO
> >>
> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
> >> Opening proxy : host53:59938
> >> 2019-08-21 20:00:34,066 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     - Received
> >> new container: container_1566383236573_0003_01_000003 - Remaining
> pending
> >> container requests: 1
> >> 2019-08-21 20:00:34,066 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     - Removing
> >> container request Capability[<memory:8192, vCores:2>]Priority[1].
> Pending
> >> container requests 0.
> >> 2019-08-21 20:00:34,088 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     - Creating
> >> container launch context for TaskManagers
> >> 2019-08-21 20:00:34,089 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     - Starting
> >> TaskManagers
> >> 2019-08-21 20:00:34,090 INFO
> >>
> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy  -
> >> Opening proxy : host52:33246
> >> 2019-08-21 20:00:36,597 INFO
> >> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     -
> Checkpoint
> >> triggering task Source: Custom Source -> Flat Map -> Filter -> Filter ->
> >> Map -> Timestamps/Watermarks -> from: (request, curuserid, timelong,
> >> rowtime) -> select: (rowtime, 0 AS $f1, curuserid) -> time attribute:
> >> (rowtime) (1/4) of job e5ea83238d721e932b4645589660406c is not in state
> >> RUNNING but SCHEDULED instead. Aborting checkpoint.
> >> 2019-08-21 20:00:37,856 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     -
> Registering
> >> TaskManager with ResourceID container_1566383236573_0003_01_000002
> >> (akka.tcp://flink@host53:34608/user/taskmanager_0) at ResourceManager
> >> 2019-08-21 20:00:39,573 INFO
> >> org.apache.flink.yarn.YarnResourceManager                     -
> Registering
> >> TaskManager with ResourceID container_1566383236573_0003_01_000003
> >> (akka.tcp://flink@host52:32931/user/taskmanager_0) at ResourceManager
> >> 2019-08-21 20:00:39,659 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> >> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (1/4)
> >> (ee39866835f7b7387f518e352787a532) switched from SCHEDULED to DEPLOYING.
> >> 2019-08-21 20:00:39,659 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        -
> Deploying
> >> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (1/4)
> >> (attempt #0) to container_1566383236573_0003_01_000002 @ host53
> >> (dataPort=52456)
> >> 2019-08-21 20:00:39,664 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> >> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (2/4)
> >> (f1a893fc5e2440331c8e6db1874ad220) switched from SCHEDULED to DEPLOYING.
> >> 2019-08-21 20:00:39,664 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        -
> Deploying
> >> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (2/4)
> >> (attempt #0) to container_1566383236573_0003_01_000002 @ host53
> >> (dataPort=52456)
> >> 2019-08-21 20:00:39,665 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> >> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (3/4)
> >> (4756a0450881325af5fb396a18dbb8ae) switched from SCHEDULED to DEPLOYING.
> >> 2019-08-21 20:00:39,666 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        -
> Deploying
> >> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (3/4)
> >> (attempt #0) to container_1566383236573_0003_01_000003 @ host52
> >> (dataPort=39650)
> >> 2019-08-21 20:00:39,666 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> >> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (4/4)
> >> (78f520582607e26b365fca483fc98d4c) switched from SCHEDULED to DEPLOYING.
> >> 2019-08-21 20:00:39,666 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        -
> Deploying
> >> Source: Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (4/4)
> >> (attempt #0) to container_1566383236573_0003_01_000003 @ host52
> >> (dataPort=39650)
> >> 2019-08-21 20:00:39,666 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
> >> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
> >> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
> >> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) ->
> select:
> >> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
> >> (0071e1c7c4f70646914329f81dcbd349) switched from SCHEDULED to DEPLOYING.
> >> 2019-08-21 20:00:39,666 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        -
> Deploying
> >> window: (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select:
> >> (COUNT(*) AS pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start,
> >> end('w$) AS w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS
> w$proctime)
> >> -> select: (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
> (attempt
> >> #0) to container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> >> 2019-08-21 20:00:39,670 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink:
> Print
> >> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from
> >> SCHEDULED to DEPLOYING.
> >> 2019-08-21 20:00:39,670 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        -
> Deploying
> >> Sink: Print to Std. Out (1/4) (attempt #0) to
> >> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> >> 2019-08-21 20:00:39,671 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink:
> Print
> >> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from
> >> SCHEDULED to DEPLOYING.
> >> 2019-08-21 20:00:39,672 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        -
> Deploying
> >> Sink: Print to Std. Out (2/4) (attempt #0) to
> >> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> >> 2019-08-21 20:00:39,672 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink:
> Print
> >> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from
> >> SCHEDULED to DEPLOYING.
> >> 2019-08-21 20:00:39,672 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        -
> Deploying
> >> Sink: Print to Std. Out (3/4) (attempt #0) to
> >> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> >> 2019-08-21 20:00:39,672 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink:
> Print
> >> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from
> >> SCHEDULED to DEPLOYING.
> >> 2019-08-21 20:00:39,673 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        -
> Deploying
> >> Sink: Print to Std. Out (4/4) (attempt #0) to
> >> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> >> 2019-08-21 20:00:39,673 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> >> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e)
> switched
> >> from SCHEDULED to DEPLOYING.
> >> 2019-08-21 20:00:39,673 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        -
> Deploying
> >> Map -> Sink: Print to Std. Out (1/4) (attempt #0) to
> >> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> >> 2019-08-21 20:00:39,675 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> >> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83)
> switched
> >> from SCHEDULED to DEPLOYING.
> >> 2019-08-21 20:00:39,675 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        -
> Deploying
> >> Map -> Sink: Print to Std. Out (2/4) (attempt #0) to
> >> container_1566383236573_0003_01_000003 @ host52 (dataPort=39650)
> >> 2019-08-21 20:00:39,675 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> >> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f)
> switched
> >> from SCHEDULED to DEPLOYING.
> >> 2019-08-21 20:00:39,675 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        -
> Deploying
> >> Map -> Sink: Print to Std. Out (3/4) (attempt #0) to
> >> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> >> 2019-08-21 20:00:39,676 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> >> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01)
> switched
> >> from SCHEDULED to DEPLOYING.
> >> 2019-08-21 20:00:39,676 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        -
> Deploying
> >> Map -> Sink: Print to Std. Out (4/4) (attempt #0) to
> >> container_1566383236573_0003_01_000002 @ host53 (dataPort=52456)
> >> 2019-08-21 20:00:39,782 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink:
> Print
> >> to Std. Out (4/4) (0e8c4210bd81e581fbf354b86397ce41) switched from
> >> DEPLOYING to RUNNING.
> >> 2019-08-21 20:00:39,783 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink:
> Print
> >> to Std. Out (3/4) (9543bbde6538ea9d407af3012fede5c8) switched from
> >> DEPLOYING to RUNNING.
> >> 2019-08-21 20:00:39,784 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> >> Sink: Print to Std. Out (4/4) (9aa5bb28f3ba75afc1de47bd9a4e5d01)
> switched
> >> from DEPLOYING to RUNNING.
> >> 2019-08-21 20:00:39,787 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> >> Sink: Print to Std. Out (3/4) (f109918ce195d1c88c2c63a44934217f)
> switched
> >> from DEPLOYING to RUNNING.
> >> 2019-08-21 20:00:39,806 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> >> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (2/4)
> >> (f1a893fc5e2440331c8e6db1874ad220) switched from DEPLOYING to RUNNING.
> >> 2019-08-21 20:00:39,809 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> >> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (1/4)
> >> (ee39866835f7b7387f518e352787a532) switched from DEPLOYING to RUNNING.
> >> 2019-08-21 20:00:39,809 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - window:
> >> (TumblingGroupWindow('w$, 'rowtime, 60000.millis)), select: (COUNT(*) AS
> >> pv, COUNT(DISTINCT curuserid) AS uv, start('w$) AS w$start, end('w$) AS
> >> w$end, rowtime('w$) AS w$rowtime, proctime('w$) AS w$proctime) ->
> select:
> >> (pv, uv, utc2local(w$end) AS EXPR$2) -> to: Row (1/1)
> >> (0071e1c7c4f70646914329f81dcbd349) switched from DEPLOYING to RUNNING.
> >> 2019-08-21 20:00:39,991 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink:
> Print
> >> to Std. Out (1/4) (0a206444b10553cce42fa34ea0b15497) switched from
> >> DEPLOYING to RUNNING.
> >> 2019-08-21 20:00:39,992 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Sink:
> Print
> >> to Std. Out (2/4) (a5d1bb759ee6c8001313fe39c7982a8b) switched from
> >> DEPLOYING to RUNNING.
> >> 2019-08-21 20:00:39,996 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> >> Sink: Print to Std. Out (1/4) (39d437bceb033f5bd37747e9142c7f0e)
> switched
> >> from DEPLOYING to RUNNING.
> >> 2019-08-21 20:00:39,997 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Map ->
> >> Sink: Print to Std. Out (2/4) (aa4f64edc4f10da2e8bdbf6ebf0aeb83)
> switched
> >> from DEPLOYING to RUNNING.
> >> 2019-08-21 20:00:40,005 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> >> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (4/4)
> >> (78f520582607e26b365fca483fc98d4c) switched from DEPLOYING to RUNNING.
> >> 2019-08-21 20:00:40,006 INFO
> >> org.apache.flink.runtime.executiongraph.ExecutionGraph        - Source:
> >> Custom Source -> Flat Map -> Filter -> Filter -> Map ->
> >> Timestamps/Watermarks -> from: (request, curuserid, timelong, rowtime)
> ->
> >> select: (rowtime, 0 AS $f1, curuserid) -> time attribute: (rowtime)
> (3/4)
> >> (4756a0450881325af5fb396a18dbb8ae) switched from DEPLOYING to RUNNING.
> >> 2019-08-21 20:00:41,605 INFO
> >> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     -
> Triggering
> >> checkpoint 1 @ 1566388841597 for job e5ea83238d721e932b4645589660406c.
> >> 2019-08-21 20:00:42,466 INFO
> >> org.apache.flink.runtime.checkpoint.CheckpointCoordinator     -
> Completed
> >> checkpoint 1 for job e5ea83238d721e932b4645589660406c (7994 bytes in 821
> >> ms).
> >>
> >>
> >>
> >> ------------------ 原始邮件 ------------------
> >> 发件人: "Yun Tang"<[hidden email]>;
> >> 发送时间: 2019年8月22日(星期四) 中午11:08
> >> 收件人: "user-zh"<[hidden email]>;
> >>
> >> 主题: Re: flink启动等待10分钟问题
> >>
> >>
> >>
> >> Hi
> >>
> >> Flink on YARN作业启动时间长,有很多原因,例如资源不够在等待,container申请的时候又退出了。默认的slot
> >> request的timeout时间是5min,感觉你的作业应该是可能遇到了一个slot request
> >> timeout,然后又重新申请。最好能提供一下jobmanager的日志才好进一步分析。
> >>
> >> 祝好
> >> 唐云
> >> ________________________________
> >> From: 々守护々 <[hidden email]>
> >> Sent: Thursday, August 22, 2019 11:04
> >> To: user-zh <[hidden email]>
> >> Subject: flink启动等待10分钟问题
> >>
> >> 各位Flink社区大佬,您好!
> >>       我使用Flink SQL (Flink 1.8.1),在hadoop 集群中调整了一下yarn-site.xml文件的内存参数:
> >>         <property>
> >>                 <name>yarn.nodemanager.resource.memory-mb</name>
> >>                 <value>16384</value>
> >>         </property>
> >>         <property>
> >>                 <name>yarn.scheduler.minimum-allocation-mb</name>
> >>                 <value>1024</value>
> >>         </property>
> >>         <property>
> >>                 <name>yarn.scheduler.maximum-allocation-mb</name>
> >>                 <value>8192</value>
> >>         </property>
> >>
> >>
> >>
> >> 然后我去启动flink on yarn任务,启动参数:./flink-1.8.1/bin/flink run -m yarn-cluster
> -p
> >> 4 -d -c streaming.StreamKafkaMain -yn 4 -ys 2 -yjm 2048 -ytm 8192  -ynm
> >> bigdata6 ./flink_bigdata6.jar
> >> 最终能能启动成功,就是启动要等待10分钟,这个等待时间太长了,请问各位大佬有什么解决的方法吗?
> >>
> >
>
>